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SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

10 

This application is a continuation-in-part of provisional application Ser. No. 
60/080,110, filed March 31, 1998, which is incorporated by reference herein. 

15 

FIELD OF THE INVENTION 
The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

20 

BACKGROUND OF THE INVENTION 
Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 

2 5 clone novel polynucleotides "directly" in the sense that they rely on information directly 

related to the discovered protein (i.e., partial DNA/ amino acid sequence of the protein 
in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect" cloning techniques such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 

3 0 sequence motif, as well as various PCR-based or low stringency hybridization cloning 

techniques, have advanced the state of the art by making available large numbers of 
DNA /amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of the 
cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
3 5 polynucleotides encoding them that the present invention is directed. 
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SUMMARY OF THE INVENTION 
In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 216 to nucleotide 2174; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 339 to nucleotide 2174; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cc359^4 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone cc359_4 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone cc359_4 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone cc359„4 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:2; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i); and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i) and that has a length that is at least 
25% of the length of SEQ ID NO:l. 
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Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:l from nucleotide 216 to nucleotide 2174; the nucleotide sequence of SEQ ID NO:l 
from nucleotide 339 to nucleotide 2174; the nucleotide sequence of the full-length protein 
coding sequence of clone cc359__4 deposited under accession number ATCC 98715; or the 
nucleotide sequence of a mature protein coding sequence of clone cc359_4 deposited 
under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone cc359_4 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:2 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:2, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID MO:2 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 321 
to amino acid 330 of SEQ ID NO:2. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:L 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 

(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID MO:l, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:l; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
cc359_4 deposited under accession number ATCC 98715; 

(ii) ' hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

and 

(b) a process comprising the steps of: 



WO 99/50405 



PCT/US99/06946 



(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

(ba) SEQ ID NO:l, but excluding the poly(A) tail at the 
5 3' end of SEQ ID NO:l ; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
cc359_4 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

1 0 (iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:l, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of SEQ ID NO:l to 

15 a nucleotide sequence corresponding to the 3' end of SEQ ID NO:l , but excluding the 
poly(A) tail at the 3' end of SEQ ID NO:!. Also preferably the polynucleotide isolated 
according to the above process comprises a nucleotide sequence corresponding to the 
cDNA sequence of SEQ ID NO:l from nucleotide 216 to nucleotide 2174, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of said sequence of 

2 0 SEQ ID NO:l from nucleotide 216 to nucleotide 2174, to a nucleotide sequence 
corresponding to the 3' end of said sequence of SEQ ID NO:l from nucleotide 216 to 
nucleotide 2174. Also preferably the polynucleotide isolated according to the above 
process comprises a nucleotide sequence corresponding to the cDNA sequence of SEQ ID 
NO:l from nucleotide 339 to nucleotide 2174, and extending contiguously from a 

2 5 nucleotide sequence corresponding to the 5' end of said sequence of SEQ ID NO:l from 

nucleotide 339 to nucleotide 2174, to a nucleotide sequence corresponding to the 3' end 
of said sequence of SEQ ID NO:l from nucleotide 339 to nucleotide 2174. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 

3 0 consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) a fragment of the amino acid sequence of SEQ ID NO:2, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:2; and 
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(c) the amino acid sequence encoded by the cDNA insert of clone 

cc359_4 deposited under accession number ATCC 98715; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:2. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:2, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:2 having biological activity, the fragment comprising the amino acid sequence from 
amino acid 321 to amino acid 330 of SEQ ID NO:2. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 202 to nucleotide 414; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 370 to nucleotide 414; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ct547_2 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ct547_2 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ct547J2 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone ct547_J2 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; , 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:4; 
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(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; 

5 (1) a polynucleotide that hybridizes under stringent conditions to any 

one of the polynucleotides specified in (a)-(i); and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i) and that has a length that is at least 
25% of the length of SEQ ID NO:3. 
1 0 Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

NO:3 from nucleotide 202 to nucleotide 414; the nucleotide sequence of SEQ ID NO:3 from 
nucleotide 370 to nucleotide 414; the nucleotide sequence of the full-length protein coding 
sequence of clone ct547_2 deposited under accession number ATCC 98715; or the 
nucleotide sequence of a mature protein coding sequence of clone ct547_2 deposited 
15 under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone ct547_2 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:4 having biological 
20 activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:4, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:4 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 30 
to amino acid 39 of SEQ ID NO:4. 

2 5 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

IDNO:3. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 
(a) a process comprising the steps of: 

3 0 (0 preparing one or more polynucleotide probes that hybridize 

in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID NO:3, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:3; and 
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25 



(ab) the nucleotide sequence of the cDNA insert of clone 
ct547_2 deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 



and 



(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
1 0 hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 

the group consisting of: 

(ba) SEQ ID NO:3, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:3; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
1 5 ct547_2 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 

20 Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:3, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of SEQ ID NO:3 to 
a nucleotide sequence corresponding to the 3' end of SEQ ID NO:3 , but excluding the 
poly(A) tail at the 3' end of SEQ ID NO:3. Also preferably the polynucleotide isolated 
according to the above process comprises a nucleotide sequence corresponding to the 
cDNA sequence of SEQ ID NO:3 from nucleotide 202 to nucleotide 414, and extending 
contiguously from a nucleotide sequence corresponding to the 5* end of said sequence of 
SEQ ID NO:3 from nucleotide 202 to nucleotide 414, to a nucleotide sequence 
corresponding to the 3' end of said sequence of SEQ ID NO:3 from nucleotide 202 to 
3 0 nucleotide 414. Also preferably the polynucleotide isolated according to the above 
process comprises a nucleotide sequence corresponding to the cDNA sequence of SEQ ID 
NO:3 from nucleotide 370 to nucleotide 414, and extending contiguously from a 
nucleotide sequence corresponding to the 5' end of said sequence of SEQ ID MO:3 from 
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nucleotide 370 to nucleotide 414, to a nucleotide sequence corresponding to the 3* end of 
said sequence of SEQ ID NO:3 from nucleotide 370 to nucleotide 414. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
5 consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) a fragment of the amino acid sequence of SEQ ID NO:4, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:4; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
1 0 ct547_2 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:4. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity, the fragment preferably 

1 5 comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:4, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:4 having biological activity, the fragment comprising the amino acid sequence from 
amino acid 30 to amino acid 39 of SEQ ID NO:4. 

In one embodiment, the present invention provides a composition comprising an 

2 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 206 to nucleotide 415; 

25 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5 from nucleotide 293 to nucleotide 415; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone en553_l deposited under accession 
number ATCC 98715; 

30 (e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone en553_l deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone en553_l deposited under accession number 
ATCC 98715; 
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(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone en553_l deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:6; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:6; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i); and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i) and that has a length that is at least 
25% of the length of SEQ ID NO:5. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 206 to nucleotide 415; the nucleotide sequence of SEQ ID NO:5 from 
nucleotide 293 to nucleotide 415; the nucleotide sequence of the full-length protein coding 
sequence of clone en553_l deposited under accession number ATCC 98715; or the 
nucleotide sequence of a mature protein coding sequence of clone en553_l deposited 
under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone en553_l deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:6 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous arriino acids of SEQ ID NO:6, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:6 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 30 
to amino acid 39 of SEQ ID NO:6. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:5. 
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Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 

(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
5 in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 

consisting of: 

(aa) SEQ ID NO:5, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:5; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
1 0 en553_l deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

15 and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

2 0 (ba) SEQ ID NO:5, but excluding the poly(A) tail at the 

3' end of SEQ ID NO:5; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
en553_l deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 

2 5 conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:5, and extending 

3 0 contiguously from a nucleotide sequence corresponding to the 5' end of SEQ ID NO:5 to 

a nucleotide sequence corresponding to the 3' end of SEQ ID NO:5 , but excluding the 
poly(A) tail at the 3' end of SEQ ID NO:5. Also preferably the polynucleotide isolated 
according to the above process comprises a nucleotide sequence corresponding to the 
cDNA sequence of SEQ ID NO:5 from nucleotide 206 to nucleotide 415, and extending 

10 
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contiguously from a nucleotide sequence corresponding to the 5' end of said sequence of 
SEQ ID NO:5 from nucleotide 206 to nucleotide 415, to a nucleotide sequence 
corresponding to the 3* end of said sequence of SEQ ID NO:5 from nucleotide 206 to 
nucleotide 415. Also preferably the polynucleotide isolated according to the above 
process comprises a nucleotide sequence corresponding to the cDNA sequence of SEQ ID 
NO:5 from nucleotide 293 to nucleotide 415, and extending contiguously from a 
nucleotide sequence corresponding to the 5 r end of said sequence of SEQ ID NO:5 from 
nucleotide 293 to nucleotide 415, to a nucleotide sequence corresponding to the 3' end of 
said sequence of SEQ ID NO:5 from nucleotide 293 to nucleotide 415. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) a fragment of the amino acid sequence of SEQ ID NO:6, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:6; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
en553_l deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 6. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
arnino acid sequence of SEQ ID NO:6 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous arnino acids 
of SEQ ID NO:6, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:6 having biological activity, the fragment comprising the amino acid sequence from 
amino acid 30 to arnino acid 39 of SEQ ID NO:6. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 120 to nucleotide 1202; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone nn296_2 deposited under accession 
number ATCC 98715; 
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(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone nn296_2 deposited under accession number ATCC 98715; 

(e) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone nn296_2 deposited under accession number 

5 ATCC 98715; 

(f) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone nn296_2 deposited under accession number ATCC 98715; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

10 Ch) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:8; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

15 © a polynucleotide which encodes a species homologue of the protein 

of (g) or (h) above ; 

(k) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(h); and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(h) and that has a length that is at least 
25% of the length of SEQ ID NO:7. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 120 to nucleotide 1202; the nucleotide sequence of the full-length 
protein coding sequence of clone nn296_2 deposited under accession number ATCC 
98715; or the nucleotide sequence of a mature protein coding sequence of clone nn296_2 
deposited under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone nn296_2 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:8 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:8, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:8 having 
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biological activity, the fragment comprising the amino acid sequence from amino acid 175 
to amino acid 184 of SEQ ID NO:8. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:7. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 

(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID NO:7, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:7; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
nn296_2 deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

(ba) SEQ ID NO:7, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:7; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
nn296_2 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:7, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of SEQ ID NO:7 to 
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a nucleotide sequence corresponding to the 3' end of SEQ ID NO:7 , but excluding the 
poly(A) tail at the 3' end of SEQ ID NO:7. Also preferably the polynucleotide isolated 
according to the above process comprises a nucleotide sequence corresponding to the 
cDNA sequence of SEQ ID NO.-7 from nucleotide 120 to nucleotide 1202, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of said sequence of 
SEQ ID NO:7 from nucleotide 120 to nucleotide 1202, to a nucleotide sequence 
corresponding to the 3' end of said sequence of SEQ ID NO:7 from nucleotide 120 to 
nucleotide 1202. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: * 

(a) the amino acid sequence of SEQ ID NO:8; 

(b) a fragment of the amino acid sequence of SEQ ID NO:8, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:8; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
nn296_2 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:8. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:8, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:8 having biological activity, the fragment comprising the amino acid sequence from 
amino acid 175 to amino acid 184 of SEQ ID NO:8. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 597 to nucleotide 704; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone nq27_13 deposited under accession 
number ATCC 98715; 
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(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone nq27_13 deposited under accession number ATCC 98715; 

(e) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone nq27_13 deposited under accession number 

5 ATCC 98715; 

(f) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone nq27_13 deposited under accession number ATCC 98715; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

10 <M a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:10 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO.T0; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:32; 

15 0) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:32 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:32; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

20 (1 > a polynucleotide which encodes a species homologue of the protein 

°f(g)-(j) above; 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(j); and 

(n) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(j) and that has a length that is at least 
25% of the length of SEQ ID NO:9. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 597 to nucleotide 704; the nucleotide sequence of the full-length 
protein coding sequence of clone nq27_13 deposited under accession number ATCC 
98715; or the nucleotide sequence of a mature protein coding sequence of clone nq27_13 
deposited under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone nq27_13 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
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comprising a fragment of the amino acid sequence of SEQ ID NO:10 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:10, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NOrlO having 
biological activity, the fragment comprising the amino acid sequence from amino acid 13 
to amino acid 22 of SEQ ID NOrlO. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:9. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 

(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes thathybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID NO:9, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:9; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
nq27_13 deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

(ba) SEQ ID NO:9, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:9; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
nq27_13 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 
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10 



(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to thecDNA sequence of SEQIDNO:9, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of SEQ ID NO:9 to 
a nucleotide sequence corresponding to the 3 1 end of SEQ ID NO:9 , but excluding the 
poly(A) tail at the 3' end of SEQ ID NO:9. Also preferably the polynucleotide isolated 
according to the above process comprises a nucleotide sequence corresponding to the 
cDNA sequence of SEQ ID NO:9 from nucleotide 597 to nucleotide 704, and extending 
contiguously from a nucleotide sequence corresponding to the 5' end of said sequence of 
SEQ ID NO:9 from nucleotide 597 to nucleotide 704, to a nucleotide sequence 
corresponding to the 3' end of said sequence of SEQ ID NO:9 from nucleotide 597 to 
nucleotide 704. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
15 consisting of: 

(a) the amino acid sequence of SEQ ID NO:10; 

(b) a fragment of the amino acid sequence of SEQ ID NO:10, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:10; and 

(c) the amino acid sequence of SEQ ID NO:32; 

< d ) a fragment of the amino acid sequence of SEQ ID NO:32, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:32; and 

(e) the amino acid sequence encoded by the cDNA insert of clone 
nq27J3 deposited under accession number ATCC 98715; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:10. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
ammo acid sequence of SEQ ID NO: 10 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NOrlO, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:10 having biological activity,^ fragment comprising the amino acid sequence 
from amino acid 13 to amino acid 22 of SEQ ID NO:10. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NOrll from nucleotide 44 to nucleotide 475; 

r> < c ) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone pk65_4 deposited under accession 
number ATCC 98715; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pk65_4 deposited under accession number ATCC 98715; 
10 (e) a polynucleotide comprising the nucleotide sequence of a mature 

protein coding sequence of clone pk65_4 deposited under accession number 
ATCC 98715; 

(f) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone pk65„4 deposited under accession number ATCC 98715; 
15 (g) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:12; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:12 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:12; 

20 ® a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
of (g) or (h) above ; 

(k) a polynucleotide that hybridizes under stringent conditions to any 
2 5 one of the polynucleotides specified in (a)-(h); and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(h) and that has a length that is at least 
25% of the length of SEQ ID NO:ll. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
30 NO:l 1 from nucleotide 44 to nucleotide 475; the nucleotide sequence of the full-length 
protein coding sequence of clone pk65L4 deposited under accession number ATCC 98715; 
or the nucleotide sequence of a mature protein coding sequence of clone pk65_4 deposited 
under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
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of clone pk65_4 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:12 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
5 preferably thirty) contiguous amino acids of SEQ ID NTO:12, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:12 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 67 
to amino acid 76 of SEQ ID NO:12. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
10 IDNO:ll. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 

(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
15 in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 

consisting of: 

(aa) SEQ ID NO:ll, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:ll; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
2 0 pk65_4 deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

25 and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 
30 (ba) . SEQ ID NO:ll, but excluding the poly(A) tail at the 

3' end of SEQ ID NO:ll; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
pk65_4 deposited under accession number ATCC 98715; 
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(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:ll, and 
extending contiguously from a nucleotide sequence corresponding to the 5' end of SEQ 
ID NO:ll to a nucleotide sequence corresponding to the 3' end of SEQ ID NO:ll , but 
excluding the poly(A) tail at the 3' end of SEQ ID NO:ll. Also preferably the 
polynucleotide isolated according to the above process comprises a nucleotide sequence 
corresponding to the cDNA sequence of SEQ ID NO:ll from nucleotide 44 to nucleotide 
475, and extending contiguously from a nucleotide sequence corresponding to the 5' end 
of said sequence of SEQ ID NO:ll from nucleotide 44 to nucleotide 475, to a nucleotide 
sequence corresponding to the 3' end of said sequence of SEQ ID NO:ll from nucleotide 
44 to nucleotide 475. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:12; 

(b) a fragment of the amino acid sequence of SEQ ID NO:12, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:12; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pk65_4 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the ainino acid sequence of SEQ ID NO:12. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:12 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:12, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:12 having biological acuvity, v the fragment comprising the amino acid sequence 
from amino acid 67 to amino acid 76 of SEQ ID NO:12. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO;13 from nucleotide 285 to nucleotide 590; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 408 to nucleotide 590; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pk855 J deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pk855_l deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone pk855_l deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone pk855_l deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:14; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:14 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO: 14; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; 

(1) a polynucleotide that hybridizes under stringent conditions to any- 
one of the polynucleotides specified in (a)-(i); and 

(m) a polynucleotide that hybridizes under stringent conditions to any- 
one of the polynucleotides specified in (a)-(i) and that has a length that is at least 
25% of the length of SEQ ID JMO:13. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 285 to nucleotide 590; the nucleotide sequence of SEQ ID NO:13 
from nucleotide 408 to nucleotide 590; the nucleotide sequence of the full-length protein 
coding sequence of clone pk855_l deposited under accession number ATCC 98715; or the 
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nucleotide sequence of a mature protein coding sequence of clone pk855_l deposited 
under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone pk855_l deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:14 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:14, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:14 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 46 
to amino acid 55 of SEQ ID NO:14. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:13. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 

(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID NO:13, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:13; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
pk855_l deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

(ba) SEQ ID NO:13, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:13; and 
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(bb) the nucleotide sequence of the cDNA insert of clone 
pk855_l deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:13, and 
extending contiguously from a nucleotide sequence corresponding to the 5' end of SEQ 
ID NO:13 to a nucleotide sequence corresponding to the 3' end of SEQ ID NO:13 , but 
excluding the poly(A) tail at the 3' end of SEQ ID NO:13. Also preferably the 
polynucleotide isolated according to the above process comprises a nucleotide sequence 
corresponding to the cDNA sequence of SEQ ID NO:13 from nucleotide 285 to nucleotide 
590, and extending contiguously from a nucleotide sequence corresponding to the 5' end 
of said sequence of SEQ ID NO:13 from nucleotide 285 to nucleotide 590, to a nucleotide 
sequence corresponding to the 3* end of said sequence of SEQ ID NO:13 from nucleotide 
285 to nucleotide 590. Also preferably the polynucleotide isolated according to the above 
process comprises a nucleotide sequence corresponding to the cDNA sequence of SEQ ID 
NO:13 from nucleotide 408 to nucleotide 590, and extending contiguously from a 
nucleotide sequence corresponding to the 5' end of said sequence of SEQ ID NO:13 from 
nucleotide 408 to nucleotide 590, to a nucleotide sequence corresponding to the 3* end of 
said sequence of SEQ ID NO:13 from nucleotide 408 to nucleotide 590. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 14; 

(b) a fragment of the amino acid sequence of SEQ ID NO:14, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:14; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pk855_l deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:14. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:14 having biological activity, the fragment preferably 
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comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:14, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:14 having biological activity, the fragment comprising the amino acid sequence 
from amino acid 46 to amino acid 55 of SEQ ID NO:14. 
5 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
1 0 NO:15 from nucleotide 193 to nucleotide 2223; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 1045 to nucleotide 2223; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone PL776J3 deposited under accession 

1 5 number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone PL776_6 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone FL776_6 deposited under accession number 

2 0 ATCC 98715; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone PLT76_6 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:16; 

25 (0 a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:16 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:16; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

30 (k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i); and 
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(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i) and that has a length that is at least 
25% of the length of SEQ ID NO:15. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 193 to nucleotide 2223; the nucleotide sequence of SEQ ID NO:15 
from nucleotide 1045 to nucleotide 2223; the nucleotide sequence of the full-length 
protein coding sequence of clone FL776_6 deposited under accession number ATCC 
98715; or the nucleotide sequence of a mature protein coding sequence of clone PL776_6 
deposited under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone PL776__6 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:16 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:16, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:16 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 333 
to amino acid 342 of SEQ ID NO:16. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:15. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 
(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQIDNO:15;and 

(ab) the nucleotide sequence of the cDNA insert of clone 
PL776_6 deposited under accession number ATCC 98715; 

(ii) hybricyzing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

and 
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(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

(ba) SEQIDNO:15;and 

(bb) the nucleotide sequence of the cDNA insert of clone 
PL776_6 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDMA sequence of SEQ ID NO:15, and 
extending contiguously from a nucleotide sequence corresponding to the 5' end of SEQ 
ID NO:15 to a nucleotide sequence corresponding to the 3' end of SEQ ID NO:15 . Also 
preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:15 from 
nucleotide 193 to nucleotide 2223, and extending contiguously from a nucleotide sequence 
corresponding to the 5' end of said sequence of SEQ ID NO:15 from nucleotide 193 to 
nucleotide 2223, to a nucleotide sequence corresponding to the 3' end of said sequence of 
SEQ ID NO:15 from nucleotide 193 to nucleotide 2223. Also preferably the polynucleotide 
isolated according to the above process comprises a nucleotide sequence corresponding 
to the cDNA sequence of SEQ ID NO:15 from nucleotide 1045 to nucleotide 2223, and 
extending contiguously from a nucleotide sequence corresponding to the 5' end of said 
sequence of SEQ ID NO:15 from nucleotide 1045 to nucleotide 2223, to a nucleotide 
sequence corresponding to the 3' end of said sequence of SEQ ID NO:15 from nucleotide 
1045 to nucleotide 2223. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid Sequence of SEQ ID NO:16; 

(b) a fragment of the amino acid sequence of SEQ ID NO:16, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:16; and 
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(c) the amino acid sequence encoded by the cDNA insert of clone 

PL776_6 deposited under accession number ATCC 98715; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:16. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:16 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:16, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:16 having biological activity, the fragment comprising the amino acid sequence 
from amino acid 333 to amino acid 342 of SEQ ID NO:16. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 198 to nucleotide 1121; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 381 to nucleotide 1121; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pm4_13 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pm4 J3 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone pm4_13 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone pm4_J3 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:18; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:18; 
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(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i); and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i) and that has a length that is at least 
25% of the length of SEQ ID NO:17. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 198 to nucleotide 1121; the nucleotide sequence of SEQ ID NO:17 
from nucleotide 381 to nucleotide 1121; the nucleotide sequence of the full-length protein 
coding sequence of clone pm4_13 deposited under accession number ATCC 98715; or the 
nucleotide sequence of a mature protein coding sequence of clone pm4_13 deposited 
under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone pm4_13 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:18 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:18, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:18 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 149 
to amino acid 158 of SEQ ID NO:18. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:17. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 
(a) a process comprising the steps of: 

(i) preparing one or more polynucleotide probes that hybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID NO:17, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:17; and 



28 



. - ....1 



WO 99/50405 PCT/US 99/06946 



(ab) the nucleotide sequence of the cDNA insert of clone 
pm4_13 deposited under accession number ATCC 98715; 
(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 
5 (iii) isolating the DNA polynucleotides detected with the 

probe(s); 

and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
1 0 hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 

the group consisting of: 

(ba) SEQ ID NO:17, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:17; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
15 pm4J3 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 

2 0 Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:17, and 
extending contiguously from a nucleotide sequence corresponding to the 5 f end of SEQ 
ID NO:17 to a nucleotide sequence corresponding to the 3* end of SEQ ID NO:17 , but 
excluding the poly(A) tail at the 3' end of SEQ ID NO:17. Also preferably the 

2 5 polynucleotide isolated according to the above process comprises a nucleotide sequence 

corresponding to the cDNA sequence of SEQ ID NO:17 from nucleotide 198 to nucleotide 
1121, and extending contiguously from a nucleotide sequence corresponding to the 5' end 
of said sequence of SEQ ID NO:17 from nucleotide 198 to nucleotide 1121, to a nucleotide 
sequence corresponding to the 3' end of said sequence of SEQ ID NO:17 from nucleotide 

3 0 198 to nucleotide 1121. Also preferably the polynucleotide isolated according to the above 

process comprises a nucleotide sequence corresponding to the cDNA sequence of SEQ ID 
NO:17 from nucleotide 381 to nucleotide 1121, and extending contiguously from a 
nucleotide sequence corresponding to the 5' end of said sequence of SEQ ID NO:17 from 
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nucleotide 381 to nucleotide 1121, to a nucleotide sequence corresponding to the 3* end 
of said sequence of SEQ ID NO:17 from nucleotide 381 to nucleotide 1121. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) a fragment of the amino acid sequence of SEQ ID NO:18, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:18; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pm4_13 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:18. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferr bly thirty) contiguous amino acids 
of SEQ ID NO:18, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:18 having biological activity, the fragment comprising the amino acid sequence 
from amino acid 149 to amino acid 158 of SEQ ID NO:18. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 19 to nucleotide 1953; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pt326_4 deposited under accession 
number ATCC 98715; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pt326_4 deposited under accession number ATCC 98715; 

(e) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone pt326_4 deposited under accession number 
ATCC 98715; 

(f) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone pt326_4 deposited under accession number ATCC 98715; 
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(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity, the fragment 
comprising eight contiguous amino acids of SEQ ID NO:20; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
of (g) or (h) above ; 

(k) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(h); and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(h) and that has a length that is at least 
25% of the length of SEQ ID NO:19. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 19 to nucleotide 1953; the nucleotide sequence of the full-length 
protein coding sequence of clone pt326^.4 deposited under accession number ATCC 98715; 
or the nucleotide sequence of a mature protein coding sequence of clone pt326_4 
deposited under accession number ATCC 98715. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone pt326_4 deposited under accession number ATCC 98715. In further preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising a fragment of the amino acid sequence of SEQ ID NO:20 having biological 
activity, the fragment preferably comprising eight (more preferably twenty, most 
preferably thirty) contiguous amino acids of SEQ ID NO:20, or a polynucleotide encoding 
a protein comprising a fragment of the amino acid sequence of SEQ ID NO:20 having 
biological activity, the fragment comprising the amino acid sequence from amino acid 317 
to amino acid 326 of SEQ ID NO:20. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:19. 

Further embodiments of the invention provide isolated polynucleotides produced 
according to a process selected from the group consisting of: 
(a) a process comprising the steps of: 
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(i) preparing one or more polynucleotide probes that hybridize 
in 6X SSC at 65 degrees C to a nucleotide sequence selected from the group 
consisting of: 

(aa) SEQ ID NO:19, but excluding the poly(A) tail at the 
3' end of SEQ ID NO:19; and 

(ab) the nucleotide sequence of the cDNA insert of clone 
pt326_4 deposited under accession number ATCC 98715; 

(ii) hybridizing said probe(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; and 

(iii) isolating the DNA polynucleotides detected with the 
probe(s); 

and 

(b) a process comprising the steps of: 

(i) preparing one or more polynucleotide primers that 
hybridize in 6X SSC at 65 degrees C to a nucleotide sequence selected from 
the group consisting of: 

(ba) SEQ ID NO:19, but excluding the poly(A) tail at the 
3' end of SEQ ID MO:19; and 

(bb) the nucleotide sequence of the cDNA insert of clone 
pt326_4 deposited under accession number ATCC 98715; 

(ii) hybridizing said primer(s) to human genomic DNA in 
conditions at least as stringent as 4X SSC at 50 degrees C; 

(iii) amplifying human DNA sequences; and 

(iv) isolating the polynucleotide products of step (b)(iii). 
Preferably the polynucleotide isolated according to the above process comprises a 
nucleotide sequence corresponding to the cDNA sequence of SEQ ID NO:19, and 
extending contiguously from a nucleotide sequence corresponding to the 5' end of SEQ 
ID NO:19 to a nucleotide sequence corresponding to the 3' end of SEQ ID NO:19 , but 
excluding the poly(A) tail at the 3' end of SEQ ID NO:19. Also preferably the 
polynucleotide isolated according to the above process comprises a nucleotide sequence 
corresponding to the cDNA sequence of SEQ ID NO:19 from nucleotide 19 to nucleotide 
1953, and extending contiguously from a nucleotide sequence corresponding to the 5' end 
of said sequence of SEQ ID NO:19 from nucleotide 19 to nucleotide 1953, to a nucleotide 
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sequence corresponding to the 3' end of said sequence of SEQ ID NO: 19 from nucleotide 
19 to nucleotide 1953. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:20; 

(b) a fragment of the amino acid sequence of SEQ ID NO:20, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:20; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pt326_4 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:20. In further preferred 
embodiments, the present invention provides a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity, the fragment preferably 
comprising eight (more preferably twenty, most preferably thirty) contiguous amino acids 
of SEQ ID NO:20, or a protein comprising a fragment of the amino acid sequence of SEQ 
ID NO:20 having biological activity, the fragment comprising the amino acid sequence 
from amino acid 317 to amino acid 326 of SEQ ID NO:20. 

In certain preferred embodiments, the polynucleotide is operably linked to an 
expression control sequence. The invention also provides a host cell, including bacterial, 
yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 
Also provided by the present invention are organisms that have enhanced, reduced, or 
modified expression of the gene(s) corresponding to the polynucleotide sequences 
disclosed herein. 

Processes are also provided for producing a protein, which comprise: 

(a) growing a culture of the host cell transformed with such 
polynucleotide compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

The protein produced according to such methods is also provided by the present 
invention. 

Protein compositions of the present invention may further comprise a 
pharmaceutical^ acceptable carrier. Compositions comprising an antibody which 
specifically reacts with such protein are also provided by the present invention. 
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Methods are also provided for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1 A and IB are schematic representations of the pED6 and pNOTs vectors, 
respectively, used for deposit of clones disclosed herein. 

10 DETAILED DESCRIPTION 

ISOLATED PROTEINS AND POLYNUCLEOTIDES 

Nucleotide and amino acid sequences, as presently determined, are reported 
below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each clone can readily be determined by sequencing of the deposited clone 

15 in accordance with known methods. The predicted amine acid sequence (both full-length 
and mature forms) can then be determined from such nucleotide sequence. The arnino 
acid sequence of the protein encoded by a particular clone can also be deteraiined by 
expression of the clone in a suitable host cell, coUectrng the protein and detennining its 
sequence. For each disclosed protein applicants have identified what they have 

2 0 deterrnined to be the reading frame best identifiable with sequence information available 
at the time of filing. 

As used herein a "secreted" protein is one which, when expressed in a suitable host 
cell, is transported across or through a membrane, including transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins include without limitation 
2 5 proteins secreted wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without limitation proteins 
which are transported across the membrane of the endoplasmic reticulum. 

Clone M cc359 4" 

30 A polynucleotide of the present invention has been identified as clone H cc359_4". 

cc359_4 was isolated from a human ddult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. cc359__4 is a full-length clone, 
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including the entire coding sequence of a secreted protein (also referred to herein as 
M cc359_4 protein"). 

The nucleotide sequence of cc359_4 as presently determined is reported in SEQ ID 
NO:l, and includes a poly(A) tail. What applicants presently believe to be the proper 
5 reading frame and the predicted amino acid sequence of the cc359_4 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:2. Amino 
acids 29 to 41 of SEQ ID NO:2 are a predicted leader/signal sequence, with the predicted 
mature amino acid sequence beginning at amino acid 42. Due to the hydrophobic nature 
of the predicted leader/signal sequence, it is likely to act as a transmembrane domain 
1 0 should the predicted leader/signal sequence not be separated from die remainder of the 
cc359_4 protein. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
cc359_4 should be approximately 2200 bp. 

The nucleotide sequence disclosed herein for cc359_4 was searched against the 

15 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. cc359_4 demonstrated at least some similarity with sequences 
identified as H23117 (ym51gl2.sl Homo sapiens cDNA clone 51970 3'). Based upon 
sequence similarity, cc359_4 proteins and each similar protein or peptide may share at 
least some activity. The cc359_4 protein sequence contains four chaoptin domains (at 

2 0 amino acids 94-115, 118-139, 142-163, and 261-282 of SEQ ID NO:2). Chaoptin is a cell- 
surface glycoprotein required for Drosophila photoreceptor cell morphogenesis. Chaoptin 
is largely composed of 41 potentially amphipathic repeats. Chaoptin repeats that have 
been reported in both yeast and human proteins are remarkably similar, suggesting their 
general importance as a structural and /or functional motif. Additionally, the cc359_4 

2 5 protein was found to contain a leucine zipper motif. The TopPredll computer program 

predicts five potential transmembrane domains within the cc359_4 protein sequence, 
centered around amino acids 20, 410, 490, 530, and 590 of SEQ ID NO:2, respectively. 

Clone "ct547 2" 

3 0 A polynucleotide of the present invention has been identified as clone "ct547_2 n . 

ct547_2 was isolated from a human aciult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the : :oded protein. ct547_2 is a full-length clone, 
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including the entire coding sequence of a secreted protein (also referred to herein as 
M ct547_2 protein"). 

The nucleotide sequence of ct547_2 as presently determined is reported in SEQ ID 
NO:3, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the ct547_2 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:4. Amino 
acids 44 to 56 of SEQ ID NO:4 are a predicted leader/signal sequence, with the predicted 
mature amino acid sequence beginning at amino acid 57. Due to the hydrophobic nature 
of the predicted leader/signal sequence, it is likely to act as a transmembrane domain 
should the predicted leader/signal sequence not be separated from the remainder of the 
ct547__2 protein. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
ct547_2 should be approximately 1600 bp. 

The nucleotide sequence disclosed herein for ct547_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. ct547J2 demonstrated at least some similarity with sequences 
identified as AA428546 (zw47d03.rl Soares total fetus Nb2HF8 9w Homo sapiens cDNA 
clone 773189 5% R20032 (yg31g04.rl Homo sapiens cDNA clone 34069 5'), R27350 
(yh53cl2.rl Homo sapiens cDNA clone 133462 5'), and R66971 (yi29el2.rl Homo sapiens 
cDNA clone 140686 5*)* Based upon sequence similarity, ct547_2 proteins and each similar 
protein or peptide may share at least some activity. 

Clone "en553 1" 

A polynucleotide of the present invention has been identified as clone M en553_l" . 
en553_l was isolated from a human fetal brain cDNA library and was identified as 
encoding a secreted or transmembrane protein on the basis of computer analysis of the 
amino acid sequence of the encoded protein. en553_l is a full-length clone, including the 
entire coding sequence of a secreted protein (also referred to herein as "en553_l protein"). 

The nucleotide sequence of en553_l as presently determined is reported in SEQ 
ID NO:5, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the en553_l protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:6. Amino 
acids 17 to 29 of SEQ ID NO:6 are a predicted leader/signal sequence, with the predicted 
mature amino acid sequence beginning at arrdno acid 30. Due to the hydrophobic nature 
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of the predicted leader/signal sequence, it is likely to act as a transmembrane domain 
should the predicted leader/signal sequence not be separated from the remainder of the 
en553_l protein. Another potential en553_l reading frame and predicted amino acid 
sequence is encoded by basepairs 1334 to 1597 of SEQ ID NO:5 and is reported in SEQ ID 
NO:31 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
en553_l should be approximately 2675 bp. 

The nucleotide sequence disclosed herein for en553_J was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BL ASTX and 
FASTA search protocols. en553 J demonstrated at least some similarity with sequences 
identified as No hits were found in the databases. The nucleotide sequence of en553 JL 
indicates that it may contain one or more copies of the f oUowing repetitive elements: Alu, 
MIR. 

Clone "nn296 2" 

A polynucleotide of the present invention has been identified as clone "nn296_2 , \ 
nn296_2 was isolated from a human fetal kidney (293 cell line) cDNA library using 
methods which are selective for cDNAs encoding secreted proteins (see U.S. Pat No. 
5,536,637), or was identified as encoding a secreted or transmembrane protein on the basis 
of computer analysis of the amino acid sequence of the encoded protein. nn296JZ is a full- 
length clone, including the entire coding sequence of a secreted protein (also referred to 
herein as M nn296_2 protein"). 

The nucleotide sequence of nn296JZ as presently determined is reported in SEQ 
ID NO:7, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the nn296_2 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:8. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
nn296J2 should be approximately 4500 bp. 

The nucleotide sequence disclosed herein for nn296_j2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
FASTA search protocols. nn296_2 demonstrated at least some similarity with sequences 
identified as AA054538 (zk83e02.sl Soares pregnant uterus NbHPU Homo sapiens cDNA 
clone 489434 3'), AA175407 (ms80c06.rl Soares mouse 3NbMS Mus musculus cDNA clone 
617866 5' similar to WP R08C7.2 CE07425), AA510586 (vg33c05.rl Soares mouse 
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mammary gland NbMMG Mus musculus cDNA clone 863144 5' similar to WP:R08C7.2 
CE07425), C18077 (Human placenta cDNA 5'-end GEN-557B03), T24416 (Human gene 
signature HUMGS06449), and W74238 (zd02f09.rl Pancreatic Islet Homo sapiens cDNA 
clone 339497 5'). The predicted arriino acid sequence disclosed herein for nn296_2 was 
searched against the GenPept and GeneSeq amino acid sequence databases using the 
BLASTX search protocol. The predicted nn296j2 protein demonstrated at least some 
similarity to sequences identified as U61953 (cosmid R08C7 [Caenorhabditis elegans]). 
Based upon sequence similarity, nn296_2 proteins and each similar protein or peptide 
may share at least some activity. The TopPredll computer program predicts two potential 
transmembrane domains within the nn296_2 protein sequence, one centered around 
amino acid 50 and another around amino acid 340 of SEQ ID NO:8. The nucleotide 
sequence of nn296_2 indicates that it may contain at least one repetitive element. 

Clone "nq27 13" 

A polynucleotide of the present invention has been identified as clone "nq27_13". 
nq27_13 was isolated from a human adult blood (erythroleukemia TF-1) cDNA library 
using methods which are selective for cDNAs encoding secreted proteins (see tLS. Pat. 
No. 5,536,637), or was identified as encoding a secreted or transmembrane protein on the 
basis of computer analysis of the amino acid sequence of the encoded protein. nq27_13 
is a full-length clone, including the entire coding sequence of a secreted protein (also 
referred to herein as "nq27JL3 protein"). 

The nucleotide sequence of nq27_13 as presently determined is reported in SEQ 
ID NO:9, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the nq27_13 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:10. 
Another potential nq27_13 reading frame and predicted amino acid sequence is encoded 
by basepairs 371 to 616 of SEQ ID NO:9 and is reported in SEQ ID NO:32. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
nq27_13 should be approximately 900 bp. 

The nucleotide sequence disclosed herein for nq27_13 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
FASTA search protocols. nq27_13 demonstrated at least some similarity with sequences 
identified as AA015599 (ze20hl0.sl Soares fetal heart NbHH19W Homo sapiens cDNA 
clone 359587 3'), AA465112 (aa32c05.rl NCI_CGAP_GCB1 Homo sapiens cDNA clone 
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IMAGE:814952 5'), AA628397 (a£26al2.sl Soares total fetus Mb2HF8 9w Homo sapiens 
cDNA clone 1032766 3*), AA628438 (af26el2.sl Soares total fetus Nb2HF8 9w Homo 
sapiens cDNA clone 1032814 3'), D79280 (Human aorta cDNA 5-end GEN-213H05), and 
H81836 (ys68gll.rl Homo sapiens cDNA clone 220004 5'). Based upon sequence 
similarity, nq27_13 proteins and each similar protein or peptide may share at least some 
activity. The TopPredE computer program predicts a potential transmembrane domain 
at the N-terrninus of the nq27_13 protein sequence, centered around amino acid 13 of SEQ 
ID NO:10. 

nq27_13 protein was expressed in a COS cell expression system, and an expressed 
protein band of approximately 11 kDa was detected in membrane fractions using SDS 
polyacrylamide gel electrophoresis. 

Clone "pk65 4" 

A polynucleotide of the present invention has been identified as clone "pk65_4 H . 
pk65_4 was isolated from a human fetal kidney (293 cell line) cDNA library using 
methods which are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 
5,536,637), or was identified as encoding a secreted or transmembrane protein on the basis 
of computer analysis of the amino acid sequence of the encoded protein. pk65_4 is a full- 
length clone, including the entire coding sequence of a secreted protein (also referred to 
herein as "pk65_4 protein"). 

The nucleotide sequence of pk65_4 as presently determined is reported in SEQ ID 
NO:ll, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the pk65_4 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:12. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
pk65_4 should be approximately 1500 bp. 

The nucleotide sequence disclosed herein for pk65_4 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
FASTA search protocols. pk65_4 demonstrated at least some similarity with sequences 
identified as AA309041 (EST179822 (Colon carcinoma (Caco-2) cell line I Homo sapiens 
cDNA 5' end), AA316187 (EST187903 fetCC cell line (matastasis to liver in mouse) H Homo 
sapiens cDNA 5' end), AA488843 (aa55al0.sl NCI_CGAP_GCB1 Homo sapiens cDNA 
clone IMAGE:824826 3'), AF022811 (Mus musculus cornichon mRNA, complete cds), 
D16980 (Human HepG2 partial cDNA, clone hmd2a04m5), N44081 (yy32a02.rl Homo 
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sapiens cDNA clone 272906 5* similar to PIR A56724 A56724 cni protein - fruit fly). The 
predicted amino acid sequence disclosed herein for pk65_4 was searched against the 
GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 
The predicted pk65_4 protein demonstrated at least some similarity to sequences 
5 identified as AB006191 (cornichon-like protein [Mus musculus)), AF0228U (cornichon 
[Mus musculus]), and U28069 (cni gene product [Drosophila melanogaster]). Based upon 
sequence similarity, pk65_4 proteins and each similar protein or peptide may share at 
least some activity. The TopPredll computer program predicts three potential 
transmembrane domains within the pk65_4 protein sequence, centered around amino 
1 0 acids 16, 67, and 133 of SEQ ID NO: 12, respectively. 

pk65_4 protein was expressed in a COS cell expression system, and an expressed 
protein band of approximately 15 kDa was detected in membrane fractions using SDS 
polyacrylamide gel electrophoresis. 

15 Clone "pk855 1" 

A polynucleotide of the present invention has been identified as clone "pk855_l", 
pk855_l was isolated from a human fetal kidney (293 cell line) cDNA library using 
methods which are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 
5,536,637), or was identified as encoding a secreted or transmembrane protein on the basis 

20 of computer analysis of the amino acid sequence of the encoded protein. pk855_l is a full- 
length clone, including the entire coding sequence of a secreted protein (also referred to 
herein as "pk855_l protein"). 

The nucleotide sequence of pk855_l as presently determined is reported in SEQ 
ID NO:13, and includes a poly(A) tail. What applicants presently believe to be the proper 

25 reading frame and the predicted amino acid sequence of the pk855_l protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:14. Amino 
acids 29 to 41 of SEQ ID NO:14 are a predicted leader/signal sequence, with the predicted 
mature amino acid sequence beginning at amino acid 42. Due to the hydrophobic nature 
of the predicted leader/signal sequence, it is likely to act as a transmembrane domain 

3 0 should the predicted leader/signal sequence not be separated from the remainder of the 
pk855_l protein. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
pk855 JL should be approximately 1400 bp. 
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The nucleotide sequence disclosed herein for pk855_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
FASTA search protocols. pk855_l demonstrated at least some similarity with sequences 
identified as AA236320 (zr53h03,rl Soares NhHMPu SI Homo sapiens cDNA clone 
667157 5'), AA491296 (aa53d03.sl NCI_CGAP_GCB1 Homo sapiens cDNA clone 
IMAGE:824645 3'), and W76249 (zd66e06.rl Soares fetal heart NbHH19W Homo sapiens 
cDNA clone 345634 5*). Based upon sequence similarity, pk855__l proteins and each 
similar protein or peptide may share at least some activity. The TopPredE computer 
program predicts an additional potential transmembrane domain within the pk855_l 
protein sequence centered around amino acid 78 of SEQ ID NO: 14. The nucleotide 
sequence of pk855_l indicates that it may contain some repeat elements (including some 
short stretches of poly(T)). 

Clone TL776 6" 

A polynucleotide of the present invention has been identified as clone TL776_6". 
PL776_6 was isolated from a human fetal kidney (293 cell line) cDNA library using 
methods which are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 
5,536,637), or was identified as encoding a secreted or transmembrane protein on the basis 
of computer analysis of the amino acid sequence of the encoded protein. PL776_6 is a full- 
length clone, including the entire coding sequence of a secreted protein (also referred to 
herein as "PL776_6 protein"). 

The nucleotide sequence of PL776_6 as presently determined is reported in SEQ 
ID NO:15, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the PL776_6 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:16. Amino 
acids 272 to 284 of SEQ ID NO:16 are a predicted leader/signal sequence, with the 
predicted mature amino acid sequence beginning at amino acid 285. Due to the 
hydrophobic nature of the predicted leader/signal sequence, it is likely to act as a 
transmembrane domain should the predicted leader/signal sequence not be separated 
from the remainder of the PL776j6 protein. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
PL776_6 should be approximately 3600 bp. 

The nucleotide sequence disclosed herein for PL776__6 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
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FASTA search protocols. ?U776J> demonstrated at least some similarity with sequences 
identified as AA009770 (zi04c06.sl Soares fetal liver spleen 1NFLS SI Homo sapiens 
cDNA clone 429802 3'), AA056343 (zl66f04.sl Stratagene colon (#937204) Homo sapiens 
cDNA clone 509599 3'), AA076433 (zml9g07.sl Stratagene pancreas (#937208) Homo 
5 sapiens r.DNA clone 526140 3% AA076517 (zml9g07.rl Stratagene pancreas (#937208) 
Homo sapiens cDNA clone 526140 5'), M93661 (Rat notch 2 mRNA), and R36803 (CBS-216 
Homo sapiens cDNA clone CBS-216 5' end). Based upon sequence similarity, PL776_6 
proteins and each similar protein or peptide may share at least some activity. 

10 Clone "pm4 13" 

A polynucleotide of the present invention has been identified as clone "pm4 JL3\ 
pm4_13 was isolated from a human fetal kidney (293 cell line) cDNA library using 
methods which are selective for cDNAs encoding secreted proteins (see U.S. Pat No. 
5,536,637), or was identified as encoding a secreted or transmembrane protein on the basis 

15 of computer analysis of the amino acid sequence of the encoded protein. pm4_13 is a full- 
length clone, including the entire coding sequence of a secreted protein (also referred to 
herein as "pm4_13 protein"). 

The nucleotide sequence of pm4_13 as presently determined is reported in SEQ 
ID NO:17, and includes a poly(A) tail. What applicants presently believe to be the proper 

2 0 reading frame and the predicted amino acid sequence of the pm4_13 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID MO:18. Arrtino 
acids 49 to 61 of SEQ ID NO:18 are a predicted leader /signal sequence, with the predicted 
mature amino acid sequence beginning at amino acid 62. Due to the hydrophobic nature 
of the predicted leader/signal sequence, it is likely to act as a transmembrane domain 

2 5 should the predicted leader/signal sequence not be separated from the remainder of the 

pm4__13 protein. 

The EcoRI/Notl restriction fragment obtainable from the deposit containing clone 
pm4_13 should be approximately 3100 bp. 

The nucleotide sequence disclosed herein for pm4_13 was searched against the 

3 0 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 

FASTA search protocols. pm4_13 demonstrated at least some similarity with sequences 
identified as H98853 (yxl5dl0.sl Homo sapiens cDNA clone 261811 3'), R43710 
(ygl9fl2.sl Homo sapiens cDNA clone 32692 3'), and T22962 (Human gene signature 
HUMGS04687). The predicted amino acid sequence disclosed herein for pm4JL3 was 
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searched against the GenPept and GeneSeq amino acid sequence databases using the 
BLASTX search protocol The predicted pm4_13 protein demonstrated at least some 
similarity to sequences identified as U20865 (ylr246wp [Saccharomyces cerevisiae]), 
Z29121 (ZK757.1 [Caenorhabditis elegans]), Z46259 (N0325 gene product [Saccharomyces 
5 cerevisiae]), Z68507 (ml8.8 [Caenorhabditis elegans]), and Z71602 (orf ynl326c [Saccharo- 
myces cerevisiae]). Based upon sequence similarity, pm4_13 proteins and each similar 
protein or peptide may share at least some activity. The TopPredH computer program 
predicts three additional potential transmembrane domains within the pm4„13 protein 
sequence, centered around amino acids 86, 184, and 225 of SEQ ID NO:18, respectively. 

Clone "pt326 4" 

A polynucleotide of the present invention has been identified as clone n pt326jr. 
pt326_4 was isolated from a human adult blood (lymphoblastic leukemia MOLT-4) cDNA 
library using methods which are selective for cDNAs encoding secreted proteins (see U.S. 
Pat No. 5,536,637), or was identified as encoding a secreted or transmembrane protein on 
the basis of computer analysis of the amino acid sequence of the encoded protein. pt326_4 
is a full-length clone, including the entire coding sequence of a secreted protein (also 
referred to herein as "pt326_4 protein"). 

The nucleotide sequence of pt326_4 as presently determined is reported in SEQ ID 
NO:19, and includes a poly(A) tail. What applicants presently believe to be the proper 
reading frame and the predicted amino acid sequence of the pt326_4 protein 
corresponding to the foregoing nucleotide sequence is reported in SEQ ID NO:20. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
pt326_4 should be approximately 3100 bp. 

The nucleotide sequence disclosed herein for pt326_4 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
FASTA search protocols. pt326_4 demonstrated at least some similarity with sequences 
identified as AA361324 (EST70660 T-cell lymphoma Homo sapiens cDNA 5' end), H94651 
(yvl9cl2.sl Homo sapiens cDNA clone 243190 3'), T22309 (Human gene signature 
HUMGS03882), and U17907 (Human chromosome 17q21 mRNA clone B169). The 
predicted amino acid sequence disclosed herein for pt326_4 was searched against the 
GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 
The predicted pt326_4 protein demonstrated at least some similarity to sequences 
identified as D89646 (GTBP-ALT [Homo sapiens]) and U28946 (G/T mismatch binding 
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protein (GTBP) [Homo sapiens]). Based upon sequence similarity, pt326_4 proteins and 
each similar protein or peptide may share at least some activity. 

Deposit of Clones 

Clones cc359A ct547_2, en553_l, nn296_2, nq27_13, pk65„4, pk855_l, PL776_6, 
pm4_13, and pt326_4 were deposited on March 31, 1998 with the American Type Culture 
Collection (10801 University Boulevard, Manassas, Virginia 20110-2209 U.S.A.) as an 
original deposit under the Budapest Treaty and were given the accession number ATCC 
98715, from which each clone comprising a particular polynucleotide is obtainable. All 
restrictions on the availability to the public of the deposited material will be irrevocably 
removed upon the granting of the patent, except for the requirements specified in 37 
C.F.R. § 1.808(b), and the term of the deposit will comply with 37 C.RR. § 1.806. 

Each clone has been transfected into separate bacterial cells (E. coli) in this 
composite deposit. Each clone can be removed from the vector in which it was deposited 
by performing an EcoRI/NotI digestion (5' site, EcoRI; 3' site, NotI) to produce the 
appropriate fragment for such clone. Each clone was deposited in either the pED6 or 
pNTOTs vector depicted in Figures 1A and IB, respectively. The pED6dpc2 vector 
("pED6") was derived from pED6dpcl by insertion of a new polylinker to facilitate 
cDNA cloning (Kaufman et al, 1991, Nucleic Acids Res. 19: 4485-4490); the pNOTs vector 
was derived from pMT2 (Kaufman et al, 1989, Mol Cell Biol 9: 946-958) by deletion of 
the DHFR sequences, insertion of a new polylinker, and insertion of the M13 origin of 
replication in the Clal site. In some instances, the deposited clone can become "flipped" 
(i.e v in the reverse orientation) in the deposited isolate. In such instances, the cDNA insert 
can still be isolated by digestion with EcoRI and NotI. However, NotI will then produce 
the 5* site and EcoRI will produce the 3' site for placement of the cDNA in proper 
orientation for expression in a suitable vector. The cDNA may also be expressed from the 
vectors in which they were deposited. 

Bacterial cells containing a particular clone can be obtained from the composite 
deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
known for that particular clone. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of an 
oligonucleotide probe that was used to isolate or to sequence each full-length clone is 
identified below, and should be most reliable in isolating the clone of interest. 
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Clone 


Probe Seauence 


cc359_4 


SEQ ID NO:21 


ct547^2 


SEQIDNO:22 


en553_l 


SEQ ID NO:23 


nn296^2 


SEO ID KTO-24 


nq27J3 


SEQ ID NO:25 


pk65_4 


SEQ ID NO:26 


pk855_l 


SEQ ID NO:27 


PL776_6 


SEQ ID NO:28 


pm4_13 


SEQ ID NO:29 


pt326_4 


SEQ ID NO:30 



In the sequences listed above which include an N at position 2, that position is occupied 
in preferred probes/primers by a biotinylated phosphoaramidite residue rather than a 
nucleotide (such as, for example, that produced by use of biotin phosphoramidite (1- 
dimethoxytrityloxy-2-(N-biotm^^ 

diisopropyl)-phosphoramadite) (Glen Research, cat. no. 10-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("NTs"), if any; 

(b) It should be designed to have a T m of approx. 80 ° C (assuming 2° for each 
A or T and 4 degrees for each G or C). 

The oligonucleotide should preferably be labeled with y- 32 P ATP (specific activity 6000 
Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 
label should preferably be removed by gel filtration chromatography or other established 
methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+§ dpm/pmole. 

The bacterial culture containing the pool of full-length clones should preferably 
be thawed and 100 ul of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 ug/ml. The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
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fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 
dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampicillin at 100 
ug/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C Other 
5 known methods of obtaining distinct, well-separated colonies can also be employed. 

Standard colony hybridization procedures should then be used to transfer the 
colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate /liter, adjusted to pH 7.0 with 
1 0 MaOH) containing 0.5% SDS, 100 ug/ml of yeast RNA, and 10 rruM EDTA (approximately 
10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix at 
a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 
incubated at 65°C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/0.5% SDS at room temperature without agitation, preferably followed 
15 by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 
A third wash with 0.1X SSC/0.5% SDS at 65°C for 30 minutes to 1 hour is optional. The 
filter is then preferably dried and subjected to autoradiography for sufficient time to 
visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

20 The positive colonies are picked, grown in culture, and plasmid DNA isolated 

using standard procedures. The clones can then, be verified by restriction analysis, 
hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 

2 5 protein may be in linear form or they may be cyclized using known methods, for example, 

as described in H.U. Saragovi, et al, Bio/Technology 10, 773-77S (1992) and in R.S. 
McDowell, et al,]. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated 
herein by reference. Such fragments may be fused to carrier molecules such as 
immunoglobulins for many purposes, including increasing the valency of protein binding 

3 0 sites. For example, fragments of the protein may be fused through linker*' sequences to 

the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
decavalent form of the protein of the invention. 
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The present invention also provides both full-length and mature forms of the 
disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of the nucleotide sequence of each disclosed clone. The mature 
form(s) of such protein may be obtained by expression of the disclosed full-length 
polynucleotide (preferably those deposited with ATCC) in a suitable mammalian cell or 
other host cell. The sequence(s) of the mature form(s) of the protein may also be 
determinable from the amino acid sequence of the full-length form. 

The present invention also provides genes corresponding to the polynucleotide 
sequences disclosed herein. "Corresponding genes'' are the regions of the genome that 
are transcribed to produce the mKNAs from which cDNA polynucleotide sequences are 
derived and may include contiguous regions of the genome necessary for the regulated 
expression of such genes. Corresponding genes may therefore include but are not limited 
to coding sequences, 5* and 3' untranslated regions, alternatively spliced exons, introns, 
promoters, enhancers, and silencer or suppressor elements. The corresponding genes can 
be isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include the preparation of probes or primers from the disclosed 
sequence information for identification and/or amplification of genes in appropriate 
genomic libraries or other sources of genomic materials. An "isolated gene" is a gene that 
has been separated from the adjacent coding sequences, if any, present in the genome of 
the organism from which the gene was isolated. 

The chromosomal location corresponding to the polynucleotide sequences 
disclosed herein may also be determined, for example by hybridizing appropriately 
labeled polynucleotides of the present invention to chromosomes in situ. It may also be 
possible to determine the corresponding chromosomal location for a disclosed 
polynucleotide by identifying significantly similar nucleotide sequences in public 
databases, such as expressed sequence tags (ESTs), that have already been mapped to 
particular chromosomal locations. For at least some of the polynucleotide sequences 
disclosed herein, public database sequences having at least some similarity to the 
polynucleotide of the present invention have been listed by database accession number. 
Searches using the GenBank accessiQn numbers of these public database sequences can 
then be performed at an Internet site provided by the National Center for Biotechnology 
Information having the address http://www.ncbi.nlm.nih.gov/UniGene/, in order to 
identify "UniGene clusters" of overlapping sequences. Many of the "UniGene clusters" 
so identified will already have been mapped to particular chromosomal sites. 
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Organisms that have enhanced, reduced, or modified expression of the gene(s) 
corresponding to the polynucleotide sequences disclosed herein are provided. The 
desired change in gene expression can be achieved through the use of antisense 
polynucleotides or ribozymes that bind and /or cleave the mRNA transcribed from the 
5 gene (Albert and Morris, 1994, Trends Pharmacol Set 15(7): 250-254; Lavarosky et al, 1997, 
Biochem. Mol Med. 62(1): 11-22; and Hampel, 1998, Prog. Nucleic Acid Res. Mol Biol 58: 1- 
39; all of which are incorporated by reference herein). Transgenic animals that have 
multiple copies of the gene(s) corresponding to the polynucleotide sequences disclosed 
herein, preferably produced by transformation of cells with genetic constructs that are 

10 stably maintained within the transformed cells and their progeny, are provided. 
Transgenic animals that have modified genetic control regions that increase or reduce 
gene expression levels, or that change temporal or spatial patterns of gene expression, are 
also provided (see European Patent No. 0 649 464 Bl, incorporated by reference herein). 
In addition, organisms are provided in which the gene(s) corresponding to the 

1 5 polynucleotide sequences disclosed herein have been partially or completely inactivated, 
through insertion of extraneous sequences into the corresponding gene(s) or through 
deletion of all or part of the corresponding gene(s). Partial or complete gene inactivation 
can be accomplished through insertion, preferably followed by imprecise excision, of 
transposable elements (Plasterk, 1992, Bioessays 14(9): 629-633; Zwaal et al, 1993, Proc. Natl 

2 0 Acad. Sci. USA 90(16): 7431-7435; Clark et al, 1994, Proc. Natl Acad. Scl USA 91(2): 719-722; 
all of which are incorporated by reference herein), or through homologous recombination, 
preferably detected by positive /negative genetic selection strategies (Mansour et al, 1988, 
Nature 336: 348-352; U.S. Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 5,614, 396; 
5,616,491; and 5,679,523; all of which are incorporated by reference herein). These 

2 5 organisms with altered gene expression are preferably eukaryotes and more preferably 

are mammals. Such organisms are useful for the development of non-human models for 
the study of disorders involving the corresponding gene(s), and for the development of 
assay systems for the identification of molecules that interact with the protein product(s) 
of the corresponding gene(s). 

3 0 Where the protein of the present invention is membrane-bound (e.g., is a receptor), 

the present invention also provides for soluble forms of such protein. In such forms, part 
or all of the intracellular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
transmembrane domains of proteins of the invention can be identified in accordance with 
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known techniques for determination of such domains from sequence information. For 
example, the TopPredll computer program can be used to predict the location of 
transmembrane domains in an amino acid sequence, domains which are described by the 
location of the center of the rransmsmbrane domain, with at least ten transmembrane 
amino acids on each side of the reported central residue(s). 

Proteins and protein fragments of the present invention include proteins with 
amino acid sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 
identity) with that disclosed protein, where sequence identity is determined by comparing 
the amino acid sequences of the proteins when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 
(more preferably 20 or more, most preferably 30 or more) contiguous amino acids that 
shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

In particular, sequence identity may be determined using WU-BLAST 
(Washington University BLAST) version 2.0 software, which builds upon WU-BLAST 
version 1.4, which in turn is based on the public domain NCBI-BLAST version 1.4 
(Altschul and Gish, 1996, Local alignment statistics, Doolittle ed. 9 Methods in Enzymology 
266: 460-480; Altschul et al, 1990, Basic local alignment search tool, Journal of 
Molecular Biology 215: 403-410; Gish and States, 1993, Identification of protein coding 
regions by database similarity search, Nature Genetics 3: 266-272; Karlin and Altschul, 
1993, Applications and statistics for multiple high-scoring segments in molecular 
sequences, Proa Natl Acad. ScL USA 90: 5873-5877; all of which are incorporated by 
reference herein). WU-BLAST version 2.0 executable programs for several UNIX 
platforms can be downloaded from ftp://blast.wustl.edu/blast/executables. The complete 
suite of search programs (BLASTP, BLASTN, BLASTX, TBLASTN, and TBLASTX) is 
provided at that site, in addition tp several support programs. WU-BLAST 2.0 is 
copyrighted and may not be sold or redistributed in any form or manner without the 
express written consent of the author; but the posted executables may otherwise be freely 
used for commercial, nonprofit, or academic purposes. In all search programs in the suite 
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- BLASTP, BLASTN, BLASTX, TBLASTN and TBLASTX - the gapped alignment 
routines are integral to the database search itself, and thus yield much better sensitivity and 
selectivity while producing the more easily interpreted output. Gapping can optionally be 
turned off in all of these programs, if desired. The default penalty (Q) for a gap of length 
one is Q=9 for proteins and BLASTP, and Q=10 for BLASTN, but may be changed to any 
integer value including zero, one through eight, nine, ten, eleven, twelve through twenty, 
twenty-one through fifty, fifty-one through one hundred, etc. The default per-residue 
penalty for extending a gap (R) is R=2 for proteins and BLASTP, and R=10 for BLASTN, 
but may be changed to any integer value including zero, one, two, three, four, five, six, 
seven, eight, nine, ten, eleven, twelve through twenty, twenty-one through fifty, fifty-one 
through one hundred, etc. Any combination of values for Q and R can be used in order to 
align sequences so as to maximize overlap and identity while minimizing sequence gaps. 
The default amino acid comparison matrix is BLOSUM62, but other amino acid 
comparison matrices such as PAM can be utilized. 

Species homologues of the disclosed polynucleotides and proteins are also 
provided by the present invention. As used herein, a "species homologue" is a protein or 
polynucleotide with a different species of origin from that of a given protein or 
polynucleotide, but with significant sequence similarity to the given protein or 
polynucleotide. Preferably, polynucleotide species homologues have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% identity) with 
the given polynucleotide, and protein species homologues have at least 30% sequence 
identity (more preferably, at least 45% identity; most preferably at least 60% identity) with 
the given protein, where sequence identity is determined by comparing the nucleotide 
sequences of the polynucleotides or the amino acid sequences of the proteins when 
aligned so as to maximize overlap and identity while minimizing sequence gaps. Species 
homologues may be isolated and identified by making suitable probes or primers from 
the sequences provided herein and screening a suitable nucleic acid source from the 
desired species. Preferably, species homologues are those isolated from mammalian 
species. Most preferably, species homologues are those isolated from certain mammalian 
species such as, for example, Pan troglodytes, Gorilla gorilla, Pongo pygmaeus, Hylobates 
concolor, Macaca mulatta, Papio papio, Papio hamadryas, Cercopithecus aethiops, Cebus capucinus, 
Aotus trivirgatus, Sanguinus oedipus, Microcebus murinus, Mus musculus, Rattus norvegicus, 
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Cricetulus griseus, Felis catus, Mustela vison, Canis familiaris, Oryctolagus cuniculus, Bos taurus, 
Ovis aries, Sus scrofa, and Equus caballus, for which genetic maps have been created 
allowing the identification of syntenic relationships between the genomic organization of 
genes in one species and the genomic organization of the related genes in another species 
(O'Brien and Seuanez, 1988, Ann. Rev. Genet 22: 323-351; O'Brien et ah, 1993, Nature 
Genetics 3:103-112; Johansson et al, 1995, Genomics 25: 682-690; Lyons et ah, 1997, Nature 
Genetics 15: 47-56; O'Brien et al, 1997, Trends in Genetics 13(10): 393-399; Carver and Stubbs, 
1997, Genome Research 7:1123-1137; all of which are incorporated by reference herein). 

The invention also encompasses allelic variants of the disclosed polynucleotides 
or proteins; that is, naturaHy-occurring alternative forms of the isolated polynucleotides 
which also encode proteins which are identical or have significantly similar sequences to 
those encoded by the disclosed polynucleotides. Preferably, allelic variants have at least 
60% sequence identity (more preferably, at least 75% identity; most preferably at least 90% 
identity) with the given polynucleotide, where sequence identity is determined by 
comparing the nucleotide sequences of the polynucleotides when aligned so as to maximize 
overlap and identity while minimizing sequence gaps. Allelic variants may be isolated and 
identified by making suitable probes or primers from the sequences provided herein and 
screening a suitable nucleic acid source from individuals of the appropriate species. 

The invention also includes polynucleotides with sequences complementary to 
those of the polynucleotides disclosed herein. 

The present invention also includes polynucleotides that hybridize under reduced 
stringency conditions, more preferably stringent conditions, and most preferably highly 
stringent conditions, to polynucleotides described herein. Examples of stringency 
conditions are shown in the table below: highly stringent conditions are those that are at 
least as stringent as, for example, conditions A-F; stringent conditions are at least as 
stringent as, for example, conditions G-L; and reduced stringency conditions are at least 
as stringent as, for example, conditions M-R. 
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5 



10 



Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 
(bp)* 


Hybridization Temperature and 
Buffer' 


Wash 

1 CI 1 lU tl u L Ul C. 

and Buffer' 


A 


DNA:DNA 


* 50 


65°C; lxSSC -or- 

42°C; lxSSC, 50% formamide 


UJ V— / U.J AjJ «-? 


B 


DNA:DNA 


<50 


Tg ; lxSSC 


V; lxSSC 


C 


DNA:RNA 


s 50 


67°C; lxSSC -or- 

45°C; lxSSC, 50% formamide 




D 


DNA:RNA 


<50 


T D *; lxSSC 


V; lxSSC 


E 


RNA:RNA 


;> 50 


50°C; lxSSC, 50% formamide 


/U i_, Uoxisbt- 


F 


RNA:RNA 


<50 


T F *; lxSSC 


T F »; lxSSC 


G 


DNA:DNA 


* 50 


oj v—, 'ixr>cH_ -or- 

42°C; 4xSSC, 50% formamide 


65 C; lxbbC 


H 


DNA:DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA-RNA 


^ 50 


o/ v^, £ ixozA^ -or- 

45°C; 4xSSC, 50% formamide 


67° C; lxSSC 


J 


DNA:RNA 


<50 


T,*;4xSSC 


T*;4xSSC 


K 


RNA-RNA 


* 50 


/ v v^, i ±xj>j\-. -or- 

50°C; 4xSSC, 50% formamide 


67 C; lxSSC 


L 


RNA:RNA 


<50 


T L *;2xSSC 


T L *;2xSSC 


M 


DNA:DNA 


>. 50 


40°C; 6xSSC, 50% formamide 


oU C;2xbSC 


N 


DNA:DNA 


<50 


T N *; 6xSSC 


T N *; 6xSSC 


O 


DNArRNA 


;> 50 


55°C;4xSSC-or- 

42°C; 6XSSC, 50% formamide 


55°C; 2xSSC 


P 


DNA:RNA 


<50 


V; 6xSSC 


V; 6xSSC 


Q 


RNA:RNA 


* 50 


60°C; 4xSSC-or- 

45°C; 6xSSC, 50% formamide 


60°C; 2xSSC 


R 


RNArRNA 


<50 


T R *; 4xSSC 


T R *;4xSSC 



*: The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When 
hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the 

2 5 hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 

or regions of optimal sequence complementarity. 

*: SSPE (lxSSPE is 0.15M NaCl, lOmM NaH 2 PC\, and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
(lxSSC is 0.15M NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 

3 0 *T B - T R : The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should 

be 5-10°C less than the melting temperature (T m ) of the hybrid, where T m is determined according to the 
following equations. For hybrids less than 18 base pairs in length, T m (°C) = 2(# of A + T bases) + 4(# ofG-h 
C bases). For hybrids between 18 and 49 base pairs in length, T m (°C) = 81.5 + 16.6(log 10 [Na + ]) + 0.41(%G+C) - 
(600/N), where N is the number of bases in the hybrid, and [Na + ] is the concentration of sodium ions in the 
3 5 hybridization buffer ([Na + ] for lxSSC = 0.165 M). 
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Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook, J., E.F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F.M. Ausubel et al., eds., 
5 Jo?: *n vViley &c Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
10 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and identity while minimizing 
sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 
15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 
2 0 linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 

2 5 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 

Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

3 0 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
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strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covaient attachments may be accomplished using known chemical or 
5 enzymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculo virus/ insect cell expression systems are commercially available in kit form from, 
1 0 e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

1 5 The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 

2 0 containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 
Sepharose®; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

2 5 Alternatively, the protein of the invention may also be expressed in a form which 

will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S^transferase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLabs (Beverly, MA), Pharmacia (Piscataway, NJ) and 

3 0 Invitrogen Corporation (Carlsbad, CA), respectively. The protein can also be tagged with 

an epitope and subsequently purified by using a specific antibody directed to such 
epitope. One such epitope ("Flag") is commercially available from the Eastman Kodak 
Company (New Haven, CT). 
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Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
all of the foregoing purification steps, in various combinations, can also be employed to 
5 provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
with the present invention as an "isolated protein/' 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
1 0 are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

The protein may also be produced by known conventional chemical synthesis. 
Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The synthetically-constructed protein sequences, by 
15 virtue of sharing primary, secondary or tertiary structural and /or conformational 
characteristics with proteins may possess biological properties in common therewith, 
including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development of antibodies. 

2 0 The proteins provided herein also include proteins characterized by amino acid 

sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications in the peptide or DNA 
sequences can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
25 insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 
amino acid to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., U.S. Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 

3 0 insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and may thus be useful for screening 
or other immunological methodologies may also be easily made by those skilled in the art 
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given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. 

USES AND BIOLOGICAL ACTIVITY 

5 The polynucleotides and proteins of the present invention are expected to exhibit 

one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
invention may be provided by adrrunistration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 
10 or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express 
15 recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constiturively or at a particular stage of tissue differentiation or development or in disease 
states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 

2 0 with endogenous DNA sequences in patients to identify potential genetic disorders; as 

probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic rnngerprmting; as a probe to "subtract-out" 
known sequences in the process of discovering other novel polynucleotides; for selecting 
and making oligomers for attachment to a "gene chip" or other support, including for 
25 examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
polynucleotide can also be used in interaction trap assays (such as, for example, those 

3 0 described in Gyuris et al, 1993, Cell 75: 791-803 and in Rossi et al, 1997, Proc. Natl Acad. 

Set USA 94: 8405-8410, all of which are incorporated by reference herein) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. 
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The proteins provided by the present invention can similarly be used in assay to 
determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these research utilities are capable of being developed into reagent 
grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed v Cold Spring Harbor Laboratory Press, Sambrook, 
J., E.F. Fritsch and T. Martians eds., 1989, and "Methods in Enzymology: Guide to 
Molecular Cloning Techniques", Academic Press, Berger, SX. and A.R. Kimmel eds., 1987. 

Nutritional Usps 

Polynucleotides and proteirts of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein 
or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
capsules. In the case of microorganisms, the protein or polynucleotide of the invention 
can be added to the medium in or or\ which the microorganism is cultured. 

Cyt okine and Cell Proliferation /Differentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 
inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 
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induce production of other cytokines in certain cell populations. Many protein factors 
discovered to date, including all known cytokines, have exhibited activity in one or more 
factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cytokine activity. The activity of a protein of the present invention is 
5 evidenced by any one of a number of routine factor dependent cell proliferation assays 
for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, 
MC9/G, M+ (preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e and CMiC 
The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

10 Assays for T-cell or thymocyte proliferation include without limitation those 

described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, EM. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et aL, J. Immunol. 137:3494-3500, 1986; 

1 5 Bertagnolli et aL, J. Irnmunol. 145:1706-1712, 1990; Bertagnolli et aL, Cellular Irnmunology 
133:327-341, 1991; Bertagnolli, et aL, J. Immunol. 149:3778-3783, 1992; Bowman et aL, J. 
ImmunoL 152: 1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node 
cells or thymocytes include, without limitation, those described in: Polyclonal T cell 

2 0 stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in Immunology. J.E.e.a. 
Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 
Assays for proliferation and differentiation of hematopoietic and lymphopoietic 

2 5 cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K, Davis, L.S. and Lipsky, P.E. In Current 
Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 
Toronto. 1991; deVries et aL, J. Exp. Med. 173:1205-1211, 1991; Moreau et aL, Nature 
336:690-692, 1988; Greenberger et aL, Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 

30 Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et aL, Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 
Interleukin 11 - Bennett, F., Giarmotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 
in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
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Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C 
and Turner, K.J. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, 
5 proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, 
E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
1 0 their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et ah, 
Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al, Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 
140:508-512, 1988. 

15 Immune Stimulating or Suppressing Activity 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 
are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 

20 in regulating (up or down) growth and proliferation of T and /or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 
infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 

2 5 protein of the present invention, including infections by HIV, hepatitis viruses, 

herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 
be useful where a boost to the immune system generally may be desirable, i.e., in the 
treatment of cancer. 

3 0 Autoimmune disorders which may be treated using a protein of the present 

invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 
Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 
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Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
example, organ transplantation), may also be treatable using a protein of the present 
5 invention. 

Using the proteins of the invention it may also be possible to regulate immune 
responses in a number of ways. Down regulation may be in the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the 
induction of an immune response. The functions of activated T cells may be inhibited by 

10 suppressing T cell responses or by inducing specific tolerance in T cells, or both. 
Immunosuppression of T cell responses is generally an active, non-antigen-specific, 
process which requires continuous exposure of the T cells to the suppressive agent. 
Tolerance, which involves inducing non-responsiveness or anergy in T cells, is 
distinguishable from immunosuppression in that it is generally antigen-specific and 

1 5 persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the 
absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 

2 0 high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 

2 5 the transplant. The administration of a molecule which inhibits or blocks interaction of 

a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomelic form of a peptide having B7-2 activity alone or in conjunction with a 
monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., B7- 
1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 

3 0 molecule to the natural ligand(s) on the immune cells without transmitting the 

corresponding costimulatory signal. ; Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
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tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
tolerance in a subject, it may also be necessary to block the function of a combination of 
B lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al, Science 257:789-792 (1992) and Turka et al, Proc. Natl. Acad. 
Sci USA, 59:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptor:ligand interactions of B lymphocyte antigens can be used to inhibit T cell 
activation and prevent production of autoantibodies or T cell-derived cytokines which 
may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a number of well-characterized animal models of 
human autoimmune diseases. Examples include murine experimental autoimmune 
encephalitis, systemic lupus erythmatosis in MRL/lprApr mice or NZB hybrid mice, 
murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856), ■ 

Upregulation of an antigen function (preferably a B lymphocyte antigen function), 
as a means of up regulating immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of enhancing an existing immune 
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response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
5 lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 

1 0 vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 
acid encoding a protein of the present invention as described herein such that the cells 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient The infected cells would now be capable of delivering a 

15 costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 
(preferably B lymphocyte antigen function) may be useful in the induction of tumor 
immunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic acid encoding at least one peptide of the present 

20 invention can be administered to a subject to overcome tumor-specific tolerance in the 
subject. If desired, the tumor cell can be transfected to express a combination of peptides. 
For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and /or B7-3-like activity. The 

2 5 transfected tumor cells are returned to the patient to result in expression of the peptides 

on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 
to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 

3 0 costimulation signal to T cells to induce a T cell mediated immune response against the 

transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 
molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class II 
molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion^ of an MHC class I a chain protein and P 2 
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microglobulin protein or an MHC class II a chain protein and an MHC class II p chain 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
Expression of the appropriate class I or class II MHC in conjunction with a peptide having 
the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
5 immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class II associated protein, such 
as the invariant chain, can also be cotransf ected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated 

1 0 immune response in a human subject may be sufficient to overcome tumor-specific 
tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 

1 5 limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1- 
3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. 
USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., 

20 J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140:508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 
1981; Herrmann et al., J. Immunol 128:1968-1974, 1982; Handa et al, J. Immunol. 
135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., 

2 5 Cellular Immunology 133:327-341, 1991 ; Brown et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switcriing 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol 144:3028-3033, 1990; and Assays for B cell function: In vitro 

3 0 antibody production, Mond, J.J. and Brunswick, M. In Current Protocols in Immunology. 

J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 
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D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai 
et ah, J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 
5 Dendritic cell-dependent assays (which will identify, among others, proteins 

expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et aL, J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 
154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 1995; 

10 Nair et aL, Journal of Virology 67:4062-4069, 1993; Huang et aL, Science 264:961-965, 
1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj 
et ah, Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/ apoptosis (which will identify, among others, 

15 proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 
et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et aL, Cell 66:233-243, 1991; Zacharchuk, 
Journal of Immunology 145:4037-4045, 1990; Zamai et aL, Cytometry 14:891-897, 1993; 

2 0 Gorczyca et ah, International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et al., Blood 
84:111-117, 1994; Fine et aL, Cellular Immunology 155:111-122, 1994; Galy et al., Blood 
85:2770-2778, 1995; Toki et aL, Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

25 

Hematopoiesis Regulating Activity 

A protein of the present invention may be useful in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 

3 0 lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 

proliferation of erythroid progenitor cells alone or in combination with other cytokines, 
thereby indicating utility, for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the production of erythroid 
precursors and /or erythroid cells; in supporting the growth and proliferation of myeloid 
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cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
5 disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and /or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 

10 limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation /chemotherapy, either in-vivo or 
ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

15 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for proliferation and differentiation of various hematopoietic lines 
are cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
20 proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
25 others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 
Hematopoietic Cells. R.I. Freshney, et al eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 
NY. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, I.K. and 
3 0 Briddell, R.A. In Culture of Hematopoietic Cells, R.I. Freshney, et al. eds. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994;-Neben et al., Experimental Hematology 22:353-359, 
1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic 
Cells. RJL Freshney, et al eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
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Allen, T\ In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture initiating cell assay, Sutherland, 
HJ. In Culture of Hematopoietic Cells. R.I. Freshney, et al eds. Vol pp. 139-162, Wiley-liss, 
Inc., New York, NY. 1994. 

5 

Tissue Growth Activity 

A protein of the present invention also may have utility in compositions used for 
bone, cartilage, tendon, ligament and /or nerve tissue growth or regeneration, as well as 
for wound healing and tissue repair and replacement, and in the treatment of burns, 
1 0 incisions and ulcers. 

A protein of the present invention, which induces cartilage and/or bone growth 
in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 
15 well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 

2 0 disease, and in other tooth repair processes. Such agents may provide an environment 

to attract bone-forming cells, stimulate growth of bone-forming cells or induce 
differentiation of progenitors of bone-forming cells. A protein of the invention may also 
be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and/or cartilage repair or by blocking inflammation or processes of tissue 
25 destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

Another category of tissue regeneration activity that may be attributable to the 
protein of the present invention is tendon/ ligament formation. A protein of the present 
invention, which induces tendon /ligament-like tissue or other tissue formation in 

3 0 circumstances where such tissue is not normally formed, has application in the healing of 

tendon or ligament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon/ ligament-like tissue inducing 
protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
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in repairing defects to tendon or ligament tissue. De novo tendon/ ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
congenital trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
5 compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
1 0 tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and/or sequestering agent as a carrier as is well known in 
the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
15 peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's/ 

2 0 Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 

syndrome. Further conditions which may be treated in accordance with the present 
invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
25 invention. 

Proteins of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 

3 0 generation or regeneration of other tissues, such as organs (including, for example, 

pancreas, liver, intestine, kidney, skin; endothelium), muscle (smooth, skeletal or cardiac) 
and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 
comprising such tissues. Part of the desired effects may be by inhibition or modulation 
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of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 

A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
5 and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The activity of a protein of the invention may, among other means, be measured 
10 by the following methods: 

Assays for tissue generation activity include, without limitation, those described 
in: International Patent Publication No. W095/ 16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endothelium ). 
1 5 Assays for wound healing activity include, without limitation, those described in: 

Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.), Year 
Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

2 0 Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 

2 5 invention, alone or in heterodimers with a member of the inhibin a family, may be useful 

as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 
and decrease spermatogenesis in male mammals. Administration of sufficient amounts 
of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin- 

3 0 P group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 

molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 
United States Patent 4,798,885. A protein of the invention may also be useful for 
advancement of the onset of fertility in sexually immature mammals, so as to increase the 
lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 
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The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91:562-572, 1972; Ling et aL, Nature 321:779-782, 1986; Vale et 
5 al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. 
Natl. Acad. Sci. USA 83:3091-3095, 1986. 

Chemotactic/Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
10 (e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, 

fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and /or endothelial cells. 

Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 

population to a desired site of action. Chemotactic or chemokinetic proteins provide 

particular advantages in treatment of wounds and other trauma to tissues, as well as in 
15 treatment of localized infections. For example, attraction of lymphocytes, monocytes or 

neutrophils to tumors or sites of infection may result in improved immune responses 

against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 

can stimulate, directly or indirectly, the directed orientation or movement of such cell 

2 0 population. Preferably, the protein or peptide has the ability to directly stimulate directed 

movement of cells. Whether a particular protein has chemotactic activity for a population 
of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 
25 by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration 
of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 

3 0 include, without limitation, those described in: Current Protocols in Immunology, Ed by 

J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 
beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
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APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994. 

Hemostatic and Thrombolytic Activity 
5 A protein of the invention may also exhibit hemostatic or thrombolytic activity. 

As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
and other hemostatic events in treating wounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or inhibiting 
10 formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 
15 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 

20 Receptor/Ligand Activity 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor /ligand interactions. Examples of 
such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 

2 5 receptors involved in cell-cell interactions and their ligands (including without limitation, 

cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 

3 0 receptor/ligand interaction. A protein of the present invention (including, without 

limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 
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Suitable assays for receptor-ligand activity include without limitation those 
described in:Current Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W.Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
5 conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
Biereretal., J. Exp. Med. 168:1145-1156, 1988; Rosenstein etaL, J. Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 
1995. 

10 Anti-Inflammatory Activity 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 
the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 

15 inflammatory process, inhibiting or promoting ceil extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 
inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 

2 0 inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 

lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 
from over production of cytokines such as TNTF or IL-1 . Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

25 

Cadherin /Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 
roles during development, particularly in defining specific cell types. Loss or alteration 
of normal cadherin expression can lead to changes in cell adhesion properties linked to 

3 0 tumor growth and metastasis. Cadherin malfunction is also implicated in other human 

diseases, such as pemphigus vulgaris 'and pemphigus f oliaceus (auto-immune blistering 
skin diseases), Crohn's disease, and some developmental abnormalities. 

The cadherin superfamily includes well over forty members, each with a distinct 
pattern of expression. All members of the superfamily have in common conserved 
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extracellular repeats (cadherin domains), but structural differences are found in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
first cadherin domain provide the basis for homophilic adhesion; modification of this 
5 recognition site can change the specificity of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophilic adhesion with other cadherins. 

E-cadherin, one member of the cadherin superf amily, is expressed in epithelial cell 
types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 

1 0 become invasive and the cancer metastasizes. Transfection of cancer cell lines with 
polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
renaming altered cell shapes to normal, restoring cells' adhesiveness to each other and to 
their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 

15 to a less advanced stage. It is likely that other cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 
encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 

2 0 in these cells by providing normal cadherin expression. 

Cancer cells have also been shown to express cadherins of a different tissue type 
than their origin, thus allowing these cells to invade and metastasize in a different tissue 
in the body. Proteins of the present invention with cadherin activity, and polynucleotides 
of the present invention encoding such proteins, can be substituted in these cells for the 
25 inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eHnxinating the tendency of the cells to metastasize. 

Additionally, proteins of the present invention with cadherin activity, and 
polynucleotides of the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins. Such antibodies can be used to block 

3 0 the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 

fomiing a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 
cancer, the less cadherin expression there will be, and this decrease in cadherin expression 
can be detected by the use of a cadherin-binding antibody. 
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Fragments of proteins of the present invention with cadherin activity, preferably 
a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
to block cadherin function by binding to cadherins and preventing them from binding in 
5 ways that produce undesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
encoding such protein fragments, can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
10 limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

Tumor Inhibition Activity 

In addition to the activities described above for immunological treatment or 
1 5 prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
antibody-dependent cell-mediated cytotoxicity (ADCC)). A protein may exhibit its tumor 
inhibitory activity by acting on tumor tissue or tumor precursor tissue, by inhibiting 
formation of tissues necessary to support tumor growth (such as, for example, by 

2 0 inhibiting angiogenesis), by causing production of other factors, agents or cell types which 

inhibit tumor growth, or by suppressing, eHrninating or inhibiting factors, agents or cell 
types which promote tumor growth. 

Other Activities 

25 A protein of the invention may also exhibit one or more of the following additional 

activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 
weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

3 0 or body part size or shape (such as ( for example, breast augmentation or diminution, 

change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 
-arbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 
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effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 

ADMINISTRATION AND POSING 

A protein of the present invention (from whatever source derived, including 
without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutically acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL~5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, 
IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem 
cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the invention, 
or to minimize side effects. Conversely, protein of the present invention may be included 
in formulations of the particular cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anu-thrombotic factor, or ann^inilammatory agent to irrinimize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

A protein of the present invention may be active in multimers (e.g., heterodimers 
or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
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compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 

The pharmaceutical composition of the invention may be in the form of a complex 
of the protein(s) of present invention along with protein or peptide antigens. The protein 
and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B 
lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
including those encoded by class I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
supplied as purified MHC-peptide complexes alone or with cc^stirnulatory molecules that 
can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

The pharmaceutical composition of the invention may be in the form of a liposome 
in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysoledthin, phospholipids, saponin, bile acids, 
and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. 
Patent No. 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

As used herein, the term "therapeutically effective amount" means the total 
amount of each active component of the pharmaceutical composition or method that is 
sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 
a combination, the term refers to combined amounts of the active ingredients that result 
in the therapeutic effect, whether administered in combination, serially or simultaneously. 
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In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
administered in accordance with the method of the invention either alone or in 
5 combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-adrriinistered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If 
10 administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic 
factors. 

Administration of protein of the present invention used in the pharmaceutical 
15 composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 
Intravenous adrninistration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
2 0 administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 
an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 

2 5 When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal 

or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 
physiological saline solution, dextrose or other saccharide solution, or glycols such as 
ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 

3 0 form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein 

of the present invention, and preferably from about 1 to 50% protein of the present 
invention. 

When a therapeutically effective amount of protein of the present invention is 
administered by intravenous, cutaneous or subcutaneous injection, protein of the present 
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invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. 
The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
5 contain, in addition to protein of the present invention, an isotonic vehicie such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 
pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 

1 0 The amount of protein of the present invention in the pharmaceutical composition 

of the present invention will depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has undergone. 
Ultimately, the attending physician will decide the amount of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 

15 will administer low doses of protein of the present invention and observe the patient's 
response. Larger doses of protein of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 ug to about 100 

2 0 mg (preferably about O.lng to about 10 mg, more preferably about 0.1 ug to about 1 mg) 
of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 
present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 

2 5 contemplated that the duration of each application of the protein of the present invention 

will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will decide on the appropriate duration of intravenous 
therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 

3 0 polyclonal and monoclonal antibodies which specifically react with the protein. As used 

herein, the term "antibody" includes without limitation a polyclonal antibody, a 
monoclonal antibody, a chimeric antibody, a single-chain antibody, a CDR-grafted 
antibody, a humanized antibody, or fragments thereof which bind to the indicated protein. 
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Such term also includes any other species derived from an antibody or antibody sequence 
which is capable of binding the indicated protein. 

Antibodies to a particular protein can be produced by methods well known to those 
skilled in the art. For example, monoclonal antibodies can be produced by generation of 
5 antibody-producing hybridomas in accordance with known methods (see for example, 
Goding, 1983, Monoclonal antibodies: principles and practice, Academic Press Inc., New 
York; and Yokoyama, 1992, "Production of Monoclonal Antibodies" in Current Protocols 
in Immunology, Unit 2.5, Greene Publishing Assoc. and John Wiley & Sons). Polyclonal 
sera and antibodies can be produced by inoculation of a mammalian subject with the 

10 relevant protein or fragments thereof in accordance with known methods. Fragments of 
antibodies, receptors, or other reactive peptides can be produced from the corresponding 
antibodies by cleavage of and collection of the desired fragments in accordance with 
known methods (see for example, Goding, supra; and Andrew et al., 1992, "Fragmentation 
of Immunoglobulins " in Current Protocols in Immunology, Unit 2.8, Greene Publishing 

15 Assoc. and John Wiley & Sons). Chimeric antibodies and single chain antibodies can also 
be produced in accordance with known recombinant methods (see for example, 5,169,939, 
5,194,594, and 5,576,184). Humanized antibodies can also be made from corresponding 
murine antibodies in accordance with well known methods (see for example, U.S. Patent 
Nos. 5,530,101, 5,585,089, and 5,693,762). Additionally, human antibodies may be 

2 0 produced in non-human animals such as mice that have been genetically altered to express 

human antibody molecules (see for example Fishwild et al, 1996, Nature Biotechnology 
14: 845-851; Mendez etal, 1997 Nature Genetics 15: 146-156 (erratum Nature Genetics 
16: 410); and U.S. Patents 5,877,397 and 5,625,126). Such antibodies maybe obtained 
using either the entire protein or fragments thereof as an immunogen. The peptide 
25 immunogens additionally may contain a cysteine residue at the carboxyl tenrunus, and 
are conjugated to a hapten such as keyhole limpet hemocyanin (KLH). Methods for 
synthesizing such peptides are known in the art, for example, as in R.P. Merrifield, J. 
Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 21L 10 (1987). 
Monoclonal antibodies binding to the protein of the invention may be useful 

3 0 diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 

antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
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abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

5 For compositions of the present invention which are useful for bone, cartilage, 

tendon or ligament regeneration, the therapeutic method includes admirristering the 
composition topically, systematically, or locally as an implant or device. When 
adininistered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 

10 be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be adrninistered simultaneously or sequentially with the composition in the 

15 methods of the invention. Preferably for bone and/or cartilage formation, the 
composition would include a matrix capable of delivering the protem-containing 
composition to the site of bone and/or cartilage damage, providing a structure for the 
developing bone and cartilage and optimally capable of being resorbed into the body- 
Such matrices may be formed of materials presently in use for other implanted medical 

2 0 applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 
25 sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins 
or extracellular matrix components. Other potential matrices are nonbiodegradable and 
chemically defined, such as sintered hydroxapatite, bioglass, aluminates, or other 

3 0 ceramics. Matrices may be comprised of combinations of any of the above mentioned 

types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in calcium- 
aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 
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Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic 
acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions 
5 from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 

1 0 carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 
acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 
wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 

1 5 provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infUtrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 

In further compositions, proteins of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 

2 0 question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-a and 
TGF-fJ), and insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary 
applications. Particularly domestic animals and thoroughbred horses, in addition to 

2 5 humans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-containing pharmaceutical composition to be 
used in tissue regeneration will be determined by the attending physician considering 
various factors which modify the action of the proteins, e.g., amount of tissue weight 
desired to be formed, the site of damage, the condition of the damaged tissue, the size of 

30 a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any infection, time of administration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitution and with inclusion of other proteins in 
the pharmaceutical composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect 
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the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and /or repair, for example, X-rays, his tomorphome trie determinations and tetracycline 
labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
5 polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a ceD or organism (including, without 
limitation, in the form of viral vectors or naked DMA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 
1 0 invention in order to proliferate or to produce a desired effect on or activity in such cells. 
Treated cells can then be introduced in vivo for therapeutic purposes. 

Patent and literature references cited herein are incorporated by reference as if 
fully set forth. 
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What is claimed is: 

1. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 216 to nucleotide 2174; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 339 to nucleotide 2174; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cc359_4 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone cc359_4 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone cc359_4 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone cc359_4 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:2; 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i); and 

(k) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i) / and that has a 
length that is at least 25% of the length of SEQ ID NO:L 

2. The polynucleotide of claim 1 wherein said polynucleotide is operably 
linked to at least one expression control sequence. 
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3. A host cell transformed with the polynucleotide of claim 2. 

4. The host cell of claim 3, wherein said cell is a mammalian cell. 

5. A process for producing a protein encoded by the polynucleotide of claim 
2, which process comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 
medium; and 

(b) purifying said protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. An isolated polynucleotide encoding the protein of claim 6. 

8. The polynucleotide of claim 7, wherein the polynucleotide comprises the 
cDNA insert of clone cc359_4 deposited under accession number ATCC 98715. 

9. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) a fragment of the amino acid sequence of SEQ ID NO:2, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:2; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
cc359_4 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

10. The protein of claim 9, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2. 

11. A composition comprising the protein of claim 9 and a pharmaceutically 
acceptable carrier. ' 

12. An isolated polynucleotide selected from the group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 202 to nucleotide 414; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 370 to nucleotide 414; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ct547_2 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ct547_2 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone ct547_2 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone ct547_2 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:4; 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i); and 

(k) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i), and that has a 
length that is at least 25% of the length of SEQ ID NO:3. 

13. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) a fragment of the amino acid sequence of SEQ ID NO:4, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:4; and 
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(c) the amino acid sequence encoded by the cDNA insert of clone 
ct547J2 deposited under accession number ATCC 98715; 
the protein being substantially free from other mammalian proteins. 

14. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 206 to nucleotide 415; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 293 to nucleotide 415; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone en553_l deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone en553__l deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone en553_l deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone en553_l deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:6; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:6; 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i); and 

(k) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i), and that has a 
length that is at least 25% of the length of SEQ ID NO:5. 
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15. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) a fragment of the amino acid sequence of SEQ ID NO:6, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:6; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
en553_l deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

16. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 120 to nucleotide 1202; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone nn296_2 deposited under accession 
number ATCC 98715; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone nn296J2 deposited under accession number ATCC 98715; 

(e) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

(f) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:8; 

(g) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(f); and 

(h) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(f), and that has a 
length that is at least 25% of the length of SEQ ID NO:7. 

17. A protein comprising an amino acid sequence selected from the group 
consisting of: 
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(a) the amino acid sequence of SEQ ID NO:8; 

(b) a fragment of the amino acid sequence of SEQ ID NO:8, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:8; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
nn296_2 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

18. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 597 to nucleotide 704; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone nq27_13 deposited under accession 
number ATCC 98715; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone nq27_13 deposited under accession number ATCC 98715; 

(e) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

(f) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:10; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:32; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:32, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:32; 

(i) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the f polynucleotides specified in (a)-(h); and 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(h), and that has a 
length that is at least 25% of the length of SEQ ID NO:9. 
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19. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 10; 

(b) a fragment of the amino acid sequence of SEQ ID NO:10, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:10; and 

(c) the amino acid sequence of SEQ ID NO:32; 

(d) a fragment of the amino acid sequence of SEQ ID NO:32, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:32; and 

(e) the amino acid sequence encoded by the cDNA insert of clone 
nq27JL3 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

20. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 11 from nucleotide 44 to nucleotide 475; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pk65_4 deposited under accession 
number ATCC 98715; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pk65_4 deposited under accession number ATCC 98715; 

(e) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:12; 

(f) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 12, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:12; 

(g) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(f); and 

(h) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(f), and that has a 
length that is at least 25% of the length of SEQ ID NO:ll. 
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21. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:12; 

(b) a fragment of the amino acid sequence of SEQ ID NO:12, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:12; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pk65_4 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

22. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

MO:13; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 13 from nucleotide 285 to nucleotide 590; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 408 to nucleotide 590; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pk855_l deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pk855_l deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone pk855_l deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone pk855_l deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:14; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ip NO:14, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:14; 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i); and 
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(k) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i) / and that has a 
length that is at least 25% of the length of SEQ ID MO:13. 

23. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:14; 

(b) a fragment of the amino acid sequence of SEQ ID NO: 14, the 
fragment comprising eight contiguous amino acids of SEQ ID NO: 14; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pk855_l deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

24. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 15 from nucleotide 193 to nucleotide 2223; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 1045 to nucleotide 2223; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone PL776j6 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone PL776_6 deposited under accession number ATCC 98715; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone PL776_6 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone PL776_6 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:16; 
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(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:16, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:16; 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i); and 

(k) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i), and that has a 
length that is at least 25% of the length of SEQ ID NO:15. 

25* A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:16; 

(b) a fragment of the amino acid sequence of SEQ ID NO:16, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:16; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
PL776_6 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

26. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 198 to nucleotide 1121; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 381 to nucleotide 1121; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pm4_13 deposited under accession 
number ATCC 98715; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDMA insert of clone pm4_13 deposited under accession number ATCC 98715; 
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(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone pm4_13 deposited under accession number 
ATCC 98715; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone pm i _13 deposited under accession number ATCC 98715; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:18; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:18; 

(j) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formarnide, to any one of the polynucleotides specified in (a)-(i); and 

(k) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(i), and that has a 
length that is at least 25% of the length of SEQ ID NO:17. 

27. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) a fragment of the amino acid sequence of SEQ ID NO:18, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:18; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pm4JL3 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 

28. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 19 to nucleotide 1953; 
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(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone pt326„4 deposited under accession 
number ATCC 98715; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone pt326_4 deposited under accession number ATCC 98715; 

(e) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(f) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20, the fragment comprising eight contiguous 
amino acids of SEQ ID NO:20; 

(g) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 65 degrees C, or 4X SSC at 42 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(f); and 

(h) a polynucleotide that hybridizes under conditions at least as 
stringent as 4X SSC at 50 degrees C, or 6X SSC at 40 degrees C with 50% 
formamide, to any one of the polynucleotides specified in (a)-(f), and that has a 
length that is at least 25% of the length of SEQ ID NO:19. 

29. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:20; 

(b) a fragment of the amino acid sequence of SEQ ID NO:20, the 
fragment comprising eight contiguous amino acids of SEQ ID NO:20; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
pt326_4 deposited under accession number ATCC 98715; 

the protein being substantially free from other mammalian proteins. 
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FIGURE 1A 



: Hindiif 1 
Pvul! 5284 \ 



Ndel 5160 \ 

BamHI 345 




BamMI 2772 Clal 2666 



Plasmid name: pED6dpc2 
Plasmid size: 5374 bp 



Comments/References: pED6dpc2 is derived from pED6dpc1 by insertion of a new 
poiylinker to facilitate cDNA cloning. SST cDNAs are cloned between EcoRI and Notl. 
pED vectors are described in Kaufman et al.{1991), NAR 19: 4485-4490. 
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FIGURE IB 




Sap! 2135 



Plasmid name: pNOTs 
Plasmid size: 4529 bp 



f 

Comments/References: pNOTs is a derivative of pMT2 (Kaufman et al,1989. Mol.CelLBioL9:i741-l750). 
OHFR was deleted and a new poiytinker was inserted between EcoRi and Hpal. M13 origin 
of replication was inserted in the Cial site. SST cDNAs are cloned between EcoRI and 
No\\ 



2/2 



WO 99/50405 



PCT/US99/06946 



SEQUENCE LISTING 

<110> Jacobs, Kenneth 
McCoy, John M. 
LaVallie, Edward R. 
Collins-Racie, Lisa A. 
Evans , Cheryl 
Merberg, David 
Treacy, Maurice 
Agostino, Michael J. 
Steininger II, Robert J. 
Genetics Institute, Inc. 

<12 0> SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

<130> GI 6067A 

<140> 
<141> 

<160> 32 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2360 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (2343) 

<400> 1 

ccacttcccc cttttgttaa ttaaaactaa gaagtcggaa tgggaacgag gtgcccagct 60 

cccgtggaga aagcttaagg acaccacgcc agtgctttcc tgccttcctt ccgagatgga 120 

aagaggagct cctagctcac ttaagccggg gtagggctgg ttctcctttc cgagccaaaa 180 

tcccaggcga tggtgaatta tgaacgtgcc acaccatgaa gctcttgtgg caggtaactg 240 

tgcaccacca cacctggaat gccatcctgc tcccgttcgt ctacctcacg gcgcaagtgt 300 

ggattctgtg tgcagccatc gctgctgccg cctcagccgg gccccagaac tgcccctccg 360 

tctgctcgtg cagtaaccag ttcagcaagg tggtgtgcac gcgccggggc ctctccgagg 420 

tcccgcaggg tattccctcg aacacccggt acctcaacct catggagaac aacatccaga 480 

tgatccaggc cgacaccttc cgccacctcc accacctgga ggtcctgcag ttgggcagga 540 

actccatccg gcagattgag gtgggggcct tcaacggcct ggccagcctc aacaccctgg 600 

agctgttcga caactggctg acagtcatcc ctagcggggc ctttgaatac ctgtccaagc 660 

tgcgggagct ctggcttcgc aacaacccca tcgaaagcat cccctcttac gccttcaacc 720 

gggtgccctc cctcatgcgc ctggacttgg gggagctcaa gaagctggag tatatctctg 780 

agggagcttt tgaggggctg ttcaacctca agtatctgaa cttgggcatg tgcaacatta 840 

aagacatgcc caatctcacc cccctggtgg ggctggagga gctggagatg tcagggaacc 900 

acttccctga gatcaggcct ggctccttcc atggcctgag ctccctcaag aagctctggg 960 

tcatgaactc acaggtcagc ctgattgagc ggaatgcttt tgacgggctg gcttcacttg 1020 

tggaactcaa cttggcccac aataacctct cttctttgcc ccatgacctc tttaccccgc 1080 

tgaggtacct ggtggagttg catctacacc acaacccttg gaactgtgat tgtgacattc 1140 

tgtggctagc ctggtggctt cgagagtata tacccaccaa ttccacctgc tgtggccgct 1200 

gtcatgctcc catgcacatg cgaggccgct acctcgtgga ggtggaccag gcctccttcc 1260 

agtgctctgc ccccttcatc atggacgcac ctcgagacct caacatttct gagggtcgga 1320 

tggcagaact taagtgtcgg actcccccta tgtcctccgt gaagtggttg ctgcccaatg 13 80 

ggacagtgct cagccacgcc tcccgccacc caaggatctc tgtcctcaac gacggcacct 1440 

tgaacttttc ccacgtgctg ctttcagaca ctggggtgta cacatgcatg gtgaccaatg 1500 

ttgcaggcaa ctccaacgcc tcggcctacc tcaatgtgag cacggctgag cttaacacct 1560 
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ccaactacag cttcttcacc acagtaacag tggagaccac ggagatctcg cctgaggaca 1620 

caacgcgaaa gtacaagcct gttcctacca cgtccactgg ttaccagccg gcatatacca 1680 

cctctaccac ggtgctcatt cagactaccc gtgtgcccaa gcaggtggca gtacccgcga 1740 

cagacaccac tgacaagatg cagaccagcc tggatgaagt cacgaagacc accaagatca 1800 

tcattggctg ctttgtggca gtgattgtgc tagctgccgc catgttgatt gtcttctata 1860 

aacttcgtaa gcggcaccag cagcggagta cagtcacagc cgcccggact gttgagataa 1920 

cccaggtgga cgaagacatc ccagcagcaa catccgcagc agcaacagca gctccgtccg 1980 

gtgtatcagg tgagggggca gtagtgctgc ccacaattca tgaccatatt aactacaaca 2040 

cctacaaacc agcacatggg gcccactgga cagaaaacag cctggggaac tctctgcacc 2100 

ccacagtcac cactatctct gaaccttata taattcagac ccataccaag gacaaggtac 2160 

aggaaactca aatatgactc cctcccccaa aaaacttata aaatgcaata gaatgcacac 2220 

aaagacagca acttttgtac agagtgggga gagacttttt cttgtatatg cttatatatt 2280 

aagtctatgg gctggttaaa aaaaacagat tatattaaaa tttaaagaca aaaagtcaaa 2340 

acnaaaaaaa aaaaaaaaaa 2360 

<210> 2 
<211> 653 
<212> PRT 

<213> Homo sapiens 
<40O> 2 

Met Lys Leu Leu Trp Gin Val Thr Val His His His Thr Trp Asn Ala 
15 10 15 

He Leu Leu Pro Phe Val Tyr Leu Thr Ala Gin Val Trp lie Leu Cys 
20 25 30 

Ala Ala He Ala Ala Ala Ala Ser Ala Gly Pro Gin Asn Cys Pro Ser 
35 40 45 

Val Cys Ser Cys Ser Asn Gin Phe Ser Lys Val Val Cys Thr Arg Arg 
50 55 60 

Gly Leu Ser Glu Val Pro Gin Gly lie Pro Ser Asn Thr Arg Tyr Leu 
65 70 75 ' ~ 80 

Asn Leu Met Glu Asn Asn He Gin Met He Gin Ala Asp Thr Phe Arg 
85 90 95 

His Leu His His Leu Glu Val Leu Gin Leu Gly Arg Asn Ser He Arg 
100 105 no 

Gin He Glu Val Gly Ala Phe Asn Gly Leu Ala Ser Leu Asn Thr Leu 
115 120 125 

Glu Leu Phe Asp Asn Trp Leu Thr Val He Pro Ser Gly Ala Phe Glu 
130 135 140 

Tyr Leu Ser Lys Leu Arg Glu Leu Trp Leu Arg Asn Asn Pro lie Glu 
145 150 155 160 

Ser He Pro Ser Tyr Ala Phe Asn Arg Val Pro Ser Leu Met Arg Leu 
165 t 170 175 

Asp Leu Gly Glu Leu Lys Lys Leu Glu Tyr He Ser Glu Gly Ala Phe 
180 las 190 

Glu Gly Leu Phe Asn Leu Lys Tyr Leu Asn Leu Gly Met Cys Asn He 

200 205 
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Lys Asp Met Pro Asn Leu Thr Pro Leu Val Gly Leu Giu Glu Leu Glu 
210 215 220 

Met Ser Gly Asn His Phe Pro Glu lie Arg Pro Gly Ser Phe His Gly 
225 230 235 240 

Leu Ser Ser Leu Lys Lys Leu Trp Val Met Asn Ser Gin Val Ser Leu 
245 250 255 

lie Glu Arg Asn Ala Phe Asp Gly Leu Ala Ser Leu Val Glu Leu Asn 
260 265 270 

Leu Ala His Asn Asn Leu Ser Ser Leu Pro His Asp Leu Phe Thr Pro 
275 280 285 

Leu Arg Tyr Leu Val Glu Leu His Leu His His Asn Pro Trp Asn Cys 
290 295 300 

Asp Cys Asp He Leu Trp Leu Ala Trp Trp Leu Arg Glu Tyr He Pro 
305 310 315 320 

Thr Asn Ser Thr Cys Cys Gly Arg Cys His Ala Pro Met His Met Arg 
325 330 335 

Gly Arg Tyr Leu Val Glu Val Asp Gin Ala Ser Phe Gin Cys Ser Ala 
340 345 350 

Pro Phe He Met Asp Ala Pro Arg Asp Leu Asn He Ser Glu Gly Arg 
355 360 365 

Met Ala Glu Leu Lys Cys Arg Thr Pro Pro Met Ser Ser Val Lys Trp 
370 375 380 

Leu Leu Pro Asn Gly Thr Val Leu Ser His Ala Ser Arg His Pro Arg 
385 390 395 400 

He Ser Val Leu Asn Asp Gly Thr Leu Asn Phe Ser His Val Leu Leu 
405 410 415 

Ser Asp Thr Gly Val Tyr Thr Cys Met Val Thr Asn Val Ala Gly Asn 
420 425 430 

Ser Asn Ala Ser Ala Tyr Leu Asn Val Ser Thr Ala Glu Leu Asn Thr 
435 440 445 

Ser Asn Tyr Ser Phe Phe Thr Thr Val Thr Val Glu Thr Thr Glu He 
450 455 460 

Ser Pro Glu Asp Thr Thr Arg Lys Tyr Lys Pro Val Pro Thr Thr Ser 
465 470 475 480 

Thr Gly Tyr Gin Pro Ala Tyr Thr Thr Ser Thr Thr Val Leu He Gin 
485 490 495 

Thr Thr Arg Val Pro Lys Gin Val Ala Val Pro Ala Thr Asp Thr Thr 
500 505 510 

Asp Lys Met Gin Thr Ser Leu Asp Glu Val Met Lys Thr Thr Lys lie 
515 520 525 
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He He Gly Cys Phe Val Ala Val 
530 535 

He Val Phe Tyr Lys Leu Arg Lys 
545 550 

Thr Ala Ala Arg Thr Val Glu He 
565 

Ala Ala Thr Ser Ala Ala Ala Thr 
580 

Glu Gly Ala Val Val Leu Pro Thr 
595 600 

Thr Tyr Lys Pro Ala His Gly Ala 
610 615 

Asn Ser Leu His Pro Thr Val Thr 
625 630 

Gin Thr His Thr Lys Asp Lys Val 
645 



He Val Leu Ala Ala Ala Met Leu 
540 

Arg His Gin Gin Arg Ser Thr Val 
555 560 

He Gin Val Asp Glu Asp lie Pro 
570 575 

Ala Ala Pro Ser Gly Val Ser Gly 
585 590 

He His Asp His He Asn Tyr Asn 
605 

His Trp Thr Glu Asn Ser Leu Gly 
620 

Thr He Ser Glu Pro Tyr He He 
635 640 

Gin Glu Thr Gin He 
650 



<210> 3 

<211> 1558 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gcgaagctga taagccaata agagtgctgt ctagaacgca tttgcaaacc gttaagagta 60 

gcttcttttt gaggcctgga cttctgattc atctatagaa gtctattatg attccatcaa 120 

ttgcatagct tgtgtacctg gtagaaattg tgtcttggaa tgaccctttc gagttattga 180 

catggctctg atgaatagaa catgagcccc aaaactaaat ccaaaaggaa ttttctatct 240 

ttcatccccc acatgtggca agacaagttg gccctttctt acccagaggt cttttgtgtg 300 

actgcatctt tctcctccgt tctccattgt gtgctttcca ttttgtcttt agtgcctata 360 

ctgttaggtg ttttcttcac tggcattcac aaatttaagc cattgctgcc tcattagcct 420 

tgtattttgt gtgcatatca tatatccaga cctgtatgtt cactttaagc attcttatat 480 

cacactgtct cctcatctac catatggtaa atgaggagac agtgtcttat atcacactgt 540 

ctcctcatct accatatggt aaatgttaaa actccacatt tgtctgcatc agggaaaatg 600 

catgggcaca catcctccct ccctccctct ctgctctcct cccttccttc aggcctctta 660 

gcattgtttg ttttcccatt tctgatacta ctactccatg ctgaagattt gccatattac 720 

tattttggaa acattgagtg atagaactcc tagaaaattt gcaaagaaat gttacatact 780 

gtatatcaaa ctctcagatt ctagtgttga aaaagtagcc tatactttgc tattacttat 840 

acctgctgcc atagaaaaaa ataagtttat tcatgacaca tttacatttg atcataaata 900 

aaagaaaaaa gggcaccttt ttggagttag tcatggtagt cattagtgat atttctgaac 960 

agttcttaat ttaaaatact tcaaaggaag taaaggtcat ggcttagctg aaggaaatgc 1020 

tccagaaatt ggactgtgta aaccatcagt: acaataatac gctgtgtatg tatgtgtata 1080 

taaaatgaga attatggcat aattggagca tttgcattaa tcaacaaact cacattgaga 1140 

caaaacttag ttttacagct gtcttgatta aagccaagtg ttccatgttg ctgtgaagaa 1200 

tagcctcttt caaatacttt ggaaagtagt tacttggaaa cttgtaaagg tattacattt 1260 

ttatatttaa acacctatag agatcttcaa ttccttgagt ctgagcttgt gggtggaatt 1320 

ctaaatttgt atcataatct gtcttttgtg aaacattttg aaaatatgta tatataatat 1380 

tgtatatgca aattgtgttg tttcacttgt aaagggaaaa ggcttatttt tctttatatt 1440 

tctgataact tgttttgcat atgaccagca ctgactgaaa ggcatgtgta gctgcaaaca 1500 

ctgttgcttt ttttgtgaaa tgaaaataaa agtatttaaa tacaaaaaaa aaaaaaaa 1558 

<210> 4 
<211> 71 
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<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Ser Pro Lys Thr Lys Ser Lys Arg Asn Phe Leu Ser Phe lie Pro 
15 10 15 

His Met Trp Gin Asp Lys Leu Ala Leu Ser Tyr Pro Glu Val ."'he Cys 
20 25 30 

Val Thr Ala Ser Phe Ser Ser Val Leu His Cys Val Leu Ser He Leu 
35 40 45 

Ser Leu Val Pro He Leu Leu Gly Val Phe Phe Thr Gly He His Lys 
50 55 60 

Phe Lys Pro Leu Leu Pro His 
65 70 



<210> 5 

<211> 2637 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ccttgagcct gttcctttac ctcaccaggc 
ctcttagggc ccctttcaat tctagtgttc 
gagaagaact ttcctgaagt tcgaatatat 
accttagagt tattcccatg ggaggatgaa 
ttgggcctgt actgctctaa ttctgaactt 
gggacggtct cctaacataa tgtttatatc 
gtttcccatc agaatttggg ttccagttca 
attcatgcta ttgtcagttt ttgcttcttt 
acttctttct ttctaaagtc aatgagtaaa 
gtgctcccag gctggagtgc atgactcatc 
agaaaagggc aagaatgttg tgttgtcttt 
tctgaattat tggcactaag ggcacatcct 
aagcagagat tctttttagg gagtccagag 
ccaatagaga tgtcagaagg gtgactgtta 
tttgagcatt gtctactcag agtgattacc 
tgcaactact tgcctcatgt tggactaaag 
ggaggcacca ggcattgtta gaaaagcagt 
gatcgggtgg acttgggttt atgtcctcgc 
ttcctcattt ctaaagtaga gacagggatg 
ttaaatggat gagaatcaga agtatttatt 
aactgaaaac aaaggtgata atttatgtgg 
atgaaggatg gaatcaaatt aagtgcaaac 
agggcaaagg tgtatggaga tgggaattca 
ggagttggga gagctgctga ccaaatgtgg 
ttggattcag gcaagctatt ttaacccagg 
atcttccagt tggggaggtg ttttgggaca 
agagaaagaa gagaagtatt tgcattttgt 
cttctatatg gtaatacatt gcaaaggaga 
tgtgtgtctt cctctcatat gcataaagga 
ttgcattgaa gaatttttaa ggtgttattc 
agctgaaaaa gatatgcatt tttttccttc 
tcaaaatttt tgccctggag gatattaggc 
aaggtagcag atgttcacat tggtagctag 
agttagagat gggggtactg agctggaggt 



ctcattttcc acatctgttg aacaatctgg 60 
tggaggtcca tgaactgttt ttgtgtagta 120 
aatatatgac agaggaaatc actgttatga 180 
gactgacatt gcttccttgc acaccaggac 240 
aatttcctgt gtggctcttg cttggttcat 300 
ccgtggctca ggcatagaga cccactcatg 360 
ccactccaca tacacacacc aaggctaatg 420 
gatattgttc tgaaggtagt ttttattgat 480 
gttatggcaa gacattggaa agacattgca 540 
tcctggcccc tccaggaaac cggagaaggc 600 
gggctgtgga gttcatgatt atggacactt 660 
gcttagcatg gaggctggac agctctggga 72 0 
catttttttc ttctttctta tcccctcttt 780 
gggttgatac attatgtggt gctctgtttt 840 
aaggtggagc tcattatgtt acctggcacc 900 
tggaaagact cctattggga attcactcag 960 
atcgcagagc acttaagagc tgagccttgt 1020 
actgtcgctt cctagattta tgacctcagt 1080 
gagggtgggg agaatatgaa ttaatttgga 1140 
tggaattggt gtgaattgga gcacatgggg 12 00 
tgtagatgaa ctaagaataa tgaaaacagc 1260 
atataaagga aatggactgg taaaagctcg 13 20 
ggttggaaag aggctgctga tacaaggact 13 80 
aaatgaagat tttttaaaaa tgggcgctgg 1440 
actcacttta attcagtcaa aatatttctt 1500 
ggaaaaaaag ggagataaga ataccatccc 1560 
tctctagagt tgcagatata tgctgtatct 1620 
tacagtgaag aaatgaagta aaataaaatg 1680 
agacaataca tgttggtaac atggatttgc 1740 
atttaatctt tatatagaca gaatttactg 1800 
tgtggctgtt ttttgaggat gctgaatcct 1860 
aatgaggtga ttttttattt attatttttt 1920 
ggcacaaacg caggttgccc aaggaagtaa 19 80 
gcaggataac gaggcgagga atgggacaaa 2040 
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gggaatatta 
ccaggtgcgg 
acctaaggtc 
aattagctgg 
gaatcgcttg 
agcctgggtg 
tatggtttat 
agccaagtca 
ttctcacaaa 
atccttatat 



tgaagtagca 
tggctcacgc 
aggagttcga 
gcgtgtggcg 
accccaggag 
acagagtgag 
tcattgggaa 
ttgcagacct 
ttctaccatt 
tttgatgaaa 



ggatttagtg 
ccctaatcct 
gaccagcctg 
cgtgcctgta 
gtggaggttg 
acaccgtctc 
gagcacctag 
ttctctccta 
atgatcatag 
ggtaaccttc 



aaaggaagca 
agcactttgg 
accaacaccc 
gtcccagcta 
cagtgcgccg 
aaaaacaata 
tgttggcatg 
ctmtgtgttc 
ttgtattcat 
acttccaata 



tataaaaatg 
gaggcctagg 
gtctctacta 
ctcaggaggc 
agattgcgtc 
aaaattaaaa 
agtgtaaaaa 
tggtcttcat 
ctccttgagt 
ttaaaaaaaa 



gacataaggg 
caggtggatc 
aaagtacaaa 
tgaggcagga 
actgcactcc 
aaaatggaca 
tgttcttagc 
atatttttct 
cagctaaata 
aaaaaaa 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2637 



<210> 6 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Lys Thr Asp He Ala Ser Leu His Thr Arg Thr Trp Ala Cys Thr 
15 10 15 

Ala Leu He Leu Asn Leu lie Ser Cys Val Ala Leu Ala Trp Phe Met 
20 25 30 

Gly Arg Ser Pro Asn He Met Phe He Ser Arg Gly Ser Gly He Glu 
35 40 45 

Thr His Ser Trp Phe Pro He Arg He Trp Val Pro Val His His Ser 
50 55 60 



Thr Tyr Thr His Gin Gly 
65 70 



<210> 7 

<211> 4398 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (4338) 

<400> 7 

ggcgctggtg gctgcggcgg cggcggcggc agcggcgctc gagcggttcc tgtcagggtc 60 

agccggcggg ccccctgggt ggtccacctg caaatcgcgg agcggcgccc cagggatcga 120 

tggcgatgaa ctataacgcg aaggatgaag tggacggtgg gcccccgtgt gctccggggg 180 

gcaccgcgaa gactcggaga ccggataaca cggccttcaa acagcaacgg ctgccagctt 240 

ggcagcccat ccttacggct ggcacggtgc tacctatttt cttcatcatc ggtctcatct 300 

tcattcccat cggcattggc atttttgtca cctccaacaa catccgcgag atcgagattg 360 

attataccgg aacagagcct tccagtccct gtaataaatg tttatctccg gatgtgacac 420 

cttgcttttg taccattaac ttcacactgg aaaagtcatt tgagggcaac gtgtttatgt 480 

attatggact gtctaatttc tatcaaaacc atcgtcgtta cgtgaaatct cgagatgata 540 

gtcaactaaa tggagattct agtgctttgc ttaatcccag taaggaatgt gaaccttatc 600 

gaagaaatga agacaaacca attgctcctt gtggagctat tgccaacagc atgtttaatg 660 

atacattaga attgtttctc attggcaatg attcttatcc tatacctatc gctttgaaaa 720 

agaaaggtat tgcttggtgg acagataaaa atgtgaaatt cagaaatccc cctggaggag 780 

acaacctgga agaacgattt aaaggtacaa caaagcctgt gaactggctt aaaccagttt 840 

acatgctgga ttctgaccca gataataatg gattcataaa tgaggatttt attgtttgga 900 

tgcgtactgc agcattacct acttttcgca agttgtatcg tcttatagaa aggaaaagtg 960 

atttacatcc aacattacca gctggccgat actctttgaa tgtcacatac aattaccctg 1020 
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tacattattt tgatggacga aaacggatga 
aaaatccatt tttggggatt gcttacatcg 
ttgtactgct agtaattaat cataaatata 
tttaatttta tattatgaaa gcaaatcatc 
cctagctttc gaatgctgat atctggttag 
tctaaaaaaa agcagtcttt gttgtttgct 
tgacgggtat aaaaaaatta gctatattga 
attctaatgt ttgcttttta ttcggacagg 
acttgtgtct actgcttaaa tctttatgct 
ctgtatatgt gtatgtattt tgtggagaag 
ttgctaacct ttcaggattc cgagaaagat 
catcattttc atgtgtataa atgcttaaag 
cagtttatcc agtacagtta tttgtcaagc 
ttgtctggca taagaattaa tgctcatgtc 
aattaaaaaa ttcctctttc cgtttggttg 
ggaatgtttt tatatgtaat catttccctt 
ttcaggtgat atgggaaaac tgcttggcag 
catgtaaatg cttaatggag actgttttca 
cttcagcttt ggaaatactc aggctgtttt 
actaatattc cagaaatttt tgaaagcagt 
ttttgcatat aggtcaaata gcaatgtgta 
gttttggttg gttttcctaa gacatacttt 
attatgggga ttttggagta tgtacatgat 
tttatttttt actaatgaat tattttaata 
tgctatggaa cttattctta aagttctagt 
atgtattaat cagaggtggc ttaattacat 
ctgaaataac tcatgtttgt gtagaagaat 
cagttatatt ttaaatcaaa aggtctgggt 
tttaaaaaac aaaaaacaat gtaatggtta 
ttatataaca tatgaaattc agttaaaaga 
aaaaatatta cttgcacgaa gtacttgatg 
gtgtgttctt tgtctaaaaa tagtctgttt 
aagattgtcc ctcttgcact tggcagttta 
gtttctgcca ctctttccag attgagtctg 
ctctttttct aattggggca cagaggatgt 
cactttctca tctgtggaga agatggaatc 
acaagtccat tgatggccta agttcctcct 
ctgttgctaa atatgaagag tatggcacat 
ggaagacatt tctttttctt ataggcttta 
attgtgttgc attttaaatg cacccagctt 
ttcaggaact attttgaatt atgaaaagat 
aagctaagaa atttcattga aatcataagg 
gtgtactact aattaataga aaatcattca 
gtttatttgc tagtgagttt ctttgtaacg 
gtatgtccta tgttaataaa aatgaacaaa 
aaaataacaa tgctccagtg ttgttgctta 
gtgagtccag gttttaactt cttcatgcct 
ttaattcaca tttgtattta ttaatcactg 
aagtggtcag attatcatct gagttacagt 
gggaatgtcc agtaaacaga gaggtaagca 
aaaaacctga ctagtgagga ggagcagctg 
tgctacctct cctgtcttca tagaaccatk 
gttggttaag ctgtaaggtt ttgttctttg 
gggagtaggg aagtggtcct tttacaagaa 
tggatgatct agtaaagtgt tttagaaccc 
taaaaatcac aattctgnta atgtcatttg 
caaaaaaaaa aaaaaaaa 



tcttgagcac tatttcatgg atgggaggaa 1080 
ctgttggatc catctccttc cttctgggag 1140 
gaaacagtag taatacagct gacattacca 1200 
tgcatgtgca tcaaggccag tcctattcaa 1260 
tatgtcattt tgaagttggc acataacttt 1320 
tcttccctac ggatgacttc taaaaatata 1380 
tcatatcaac actgtaactg ctgaaatggc 1440 
ccacatgatg catagagcct ctttcatgtg 1500 
gtgttgatga tattatattg acatatgaag 1560 
agggattaca agatgtatga gtataatgac 1620 
gaagaaagac catatctaaa taatacactt 1680 
taccatcttt gttgaggtgg ttcatgtatc 1740 
ttagctttga tttcaaagga cacgcttacc 1800 
tgcagtggtt gggtaggtcc tgcttaggag 1860 
aatgttgcag tcaggaaccc caactcactt 1920 
gaagcttata ctttataagg gaagaaagaa 1980 
accttcatct tctgcctcaa ctgtaaacca 2040 
ttcttgtgat atttaacatt cagaaaatta 2100 
tattctgcag gtaagtgttt tgacttaagt 2160 
aaccttaatt tcctatgtat ttcattccac 2220 
tgcacattct ctttagttaa ggcaccaatt 2280 
aaaaagatgt tctataaatt tcctagttaa 2 340 
aaattataat acgtatatgg ttgaagttat 2400 
ttccttattg aataaatgct gtaacttgtt 2460 
taaaaataat ttttccacat gcatgaaaat 252 0 
tgaaattgct tttttgttgt tgttttttta 2580 
gcctgtttac tcagagttta tattttcctt 2640 
aatgtatact tttgattaat atatactttt 2700 
atagtagaaa tgtgccacac ttttcaagtt 2760 
atgtgtgttt cataatgact tttaactggt 2820 
tatggttatc ctgaaatttc ggagtatttg 2880 
tgtcagtcct tcagaatatt atttattctg 2940 
ttttcgggga tacattgttg ggggagaggg 3000 
tgctgtttaa ggaggactac catcctgcaa 3060 
cgctaaagaa aagttgaaga gccctttcag 3120 
ttaaaataca tttggagttt tatctgtttt 31£0 
gttttctgct gtttgatctc taaggaactc 3240 
tcatatagtc tctgtgaagc atggggggag 3300 
tgctcaaatg tcatagtctc ctttcaaaga 3360 
aagtagaaga cattgaagga tgcattaatt 3420 
tcccaattga aaaaattatt caacaagtaa 3480 
cagtttaagc ataaattgat aaaaatagct 3 54 0 
accaagagaa gagtcaagtg aatatcgttt 3 600 
ttgattttat taaatgataa tatttggtta 3 660 
attaattttg ctatgttcag gtgtcttgat 3720 
catttagcac taaattttaa cacagggtca 3780 
ggatgggata aaatgtaatt cattgttaaa 3840 
tgacaacatt aaccatttgt tcttaccagg 3900 
tagactggct aagtttggta ttagatcaag 3 960 
tgatggaaat aatgaagtgg ggtacacagg 402 0 
agagataggg tcagtgaatg cggttcagcc 4080 
gccttagaat tattgtatga cacgtttttt 4140 
tgaacatggg tattttgagg ggagggtgga 4200 
ttttgatgca taagtgtcta ttgtagggtt 4260 
ctttttatcc catgcaccat tcagtaaaca 4320 
gaacttcaaa ataaatatct tgtctaaaaa 4380 

4398 



<210> 8 
<211> 361 
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<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ala Met Asn Tyr Asn Ala Lys Asp Glu Val Asp Gly Gly Pro Pro 
1 5 10 ^ 15 

Cys Ala Pro Gly Gly Thr Al- ^ys Thr Arg Arg Pro Asp Asn Thr Ala 
20 25 30 

Phe Lys Gin Gin Arg Leu Pro Ala Trp Gin Pro He Leu Thr Ala Gly 
35 40 45 

Thr Val Leu Pro He Phe Phe He He Gly Leu He Phe He Pro He 
50 55 60 

Gly He Gly He Phe Val Thr Ser Asn Asn He Arg Glu He Glu He 
65 70 75 " 80 

Asp Tyr Thr Gly Thr Glu Pro Ser Ser Pro Cys Asn Lys Cys Leu Ser 
85 90 95 

Pro Asp Val Thr Pro Cys Phe Cys Thr He Asn Phe Thr Leu Glu Lys 
100 105 no 

Ser Phe Glu Gly Asn Val Phe Met Tyr Tyr Gly Leu Ser Asn Phe Tyr 
115 120 125 

Gin Asn His Arg Arg Tyr Val Lys Ser Arg Asp Asp Ser Gin Leu Asn 
130 135 140 

Gly Asp Ser Ser Ala Leu Leu Asn Pro Ser Lys Glu Cys Glu Pro Tyr 
145 150 155 160 

Arg Arg Asn Glu Asp Lys Pro He Ala Pro Cys Gly Ala He Ala Asn 
165 170 175 

Ser Met Phe Asn Asp Thr Leu Glu Leu Phe Leu He Gly Asn Asp Ser 
18° 185 190 

Tyr Pro He Pro He Ala Leu Lys Lys Lys Gly He Ala Trp Trp Thr 
195 200 205 

Asp Lys Asn Val Lys Phe Arg Asn Pro Pro Gly Gly Asp Asn Leu Glu 
210 215 220 

Glu Arg Phe Lys Gly Thr Thr Lys Pro Val Asn Trp Leu Lys Pro Val 
225 230 235 * 240 

Tyr Met Leu Asp Ser Asp Pro Asp Asn Asn Gly Phe He Asn Glu Asp 
24 5 250 255 

Phe He Val Trp Met Arg Thr Ala Ala Leu Pro Thr Phe Arg Lys Leu 
260 265, 270 

Tyr Arg Leu lie Glu Arg Lys Ser Asp Leu His Pro Thr Leu Pro Ala 
275 280 285 

Gly Arg Tyr Ser Leu Asn Val Thr Tyr Asn Tyr Pro Val His Tyr Phe 
290 295 300 
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Asp Gly Arg Lys Arg Met He Leu Ser Thr He Ser Trp Met Gly Gly 
305 310 315 320 

Lys Asn Pro Phe Leu Gly He Ala Tyr He Ala Val Gly Ser lie Ser 
325 330 335 

Phe Leu Leu Gly Val Val Leu Leu Val He Asn His Lys Tyr Arg Asn 
340 345 350 

Ser Ser Asn Thr Ala Asp He Thr He 
355 360 



<210> 9 

<211> 777 

<212> DNA 

<213> Homo sapiens 



<400> 9 

agaaattata gagtgagtgt attttccttg 

tggtaaagtt gattacatca aagaatcttt 

tggtctctct aggtttttat ataaatagtg 

tttagaactc attttcctca gtagagacta 

tatcggttct taattaaatt atccaaaacg 

tcttttgtgt ttgcttttta aaataaagtg 

aattatctac atgatgttta tagttcctgt 

cagtacattt agttttataa gagtcatgaa 

aagtctagaa tgtgtgtatt cacaatggtg 

aagtgtctca gaaaacctgg acagttcgct 

agatgattct gtaccctagt atatcttttt 

catattctgc acgatctgta tatagtacat 

tttttttaaa aactgggagg tcaataaaat 



tgtatatatt aaacacaccc atacaaacat 60 

agcttatctt ttgaagctac tggatattat 120 

aaatctgaat tactgaaaac catgttaatt 180 

gtgatgcatt agcttctggg aacaaacttg 2 40 

gaggcattta aacacttgga tttacaccag 300 

ctcgtatttg tattctccat attttggagt 3 60 

ggtttttcac ccaagaagca gaatctcatt 420 

gctaaatcct tgggctatgt cagaggcaca 480 

tatgtacatt ttgtgccttg attcacttag 540 

tctacacaag aattttatat gtatttatga 600 

gggcatggac taatttgtat ctgtttaact 660 

caaacttaga ggtgtgacct ta*aatttaac 720 

ttaaactgct taaaaaaaaa aaaaaaa 777 



<210> 10 
<211> 36 
<212> PRT 

<213> Homo sapiens 



<400> 10 

Met Lys Met He Leu Tyr Pro Ser 
1 5 

Leu Tyr Leu Phe Asn Ser Tyr Ser 
20 



He Ser Phe Trp Ala Trp Thr Asn 
10 15 

Ala Arg Ser Val Tyr Ser Thr Ser 
25 30 



Asn Leu Glu Val 
35 



<210> 11 

<211> 1378 

<212> DNA 

<213> Homo sapiens 

<400> 11 

ctccgctggc aacggcgccg ctccccgctc 
ggccttctgc tacatgctgg cgctgctgct 
gcacattata gcatttgatg agctgaagac 
taccctgaat ccccttgtac tcccagagta 



ctcctcccca gccatggcgt tcacgttcgc 60 
cactgccgcg ctcatcttct tcgccatttg 120 
tgattacaag aatcctatag accagtgtaa 180 
cctcatccac gctttcttct gtgtcatgtt 240 
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tctttgtgca gcagagtggc ttacactggg tctcaatatg cccctcttgg catatcatat 3 00 
ttggaggtat atgagtagac cagtgatgag tggcccagga ctctatgacc ctacaaccat 3 60 
catgaatgca gatattctag catattgtca gaaggaagga tggtgcaaat tagcttttta 420 
tcttctagca tttttttact acctatatgg catgatctat gttttggtga gctcttagaa 480 
caacacacag aagaattggt ccagttaagt gcatgcaaaa agccaccaaa tgaagggatt 540 
ctatccagca agatcctgtc caagagtagc ctgtggaatc tgatcagtta ctttaaaaaa 600 
tgactcctta ttttttaaat gtttccacat ttttgcttgt ggaaagactg ttttcatatg 660 
ttatactcag ataaagattt taaatggtat tacgtataaa ttaatataaa atgattacct 720 
ctggtgttga caggtttgaa cttgcacttc ttaaggaaca gccataatcc tctgaatgat 780 
gcattaatta ctgactgtcc tagtacattg gaagcttttg tttataggaa cttgtagggc 840 
tcattttggt ttcattgaaa cagtatctaa ttataaatta gctgtagata tcaggtgctt 900 
ctgatgaagt gaaaatgtat atctgactag tgggaaactt catgggtttc ctcatctgtc 960 
atgtcgatga ttatatatgg atacatttac aaaaataaaa agcgggaact ttcccttcgc 1020 
ttgaatatta tccctgtata ttgcatgaat gagagatttc ccatatttcc atcagagtaa 1080 
taaatatact tgctttaatt cttaagcata agtaaacatg atataaaaac atatgctgaa 1140 
ttacttgtga agaatgcatt taaagctatt ttaaatgtgt ttttatttgt aagacattac 1200 
ttattaagaa attggttatt atgcttactg ttctaatctg gtggtaaagg tattcttaag 1260 
aatttgcagg tactacagat tttcaaaact gaatgagaga aaattgtata accatcctgc 1320 
tgttccttta gtgcaataca ataaaactct gaaattaaga ctcaaaaaaa aaaaaaaa 1378 

<210> 12 

<211> 144 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Ala Phe Thr Phe Ala Ala Phe Cys Tyr Met Leu Ala Leu Leu Leu 
IS 10 15 

Thr Ala Ala Leu lie Phe Phe Ala lie Trp His He lie Ala Phe Asp 
20 25 3 0 

Glu Leu Lys Thr Asp Tyr Lys Asn Pro He Asp Gin Cys Asn Thr Leu 
35 40 45 

Asn Pro Leu Val Leu Pro Glu Tyr Leu He His Ala Phe Phe Cys Val 
50 55 60 

Met Phe Leu Cys Ala Ala Glu Trp Leu Thr Leu Gly Leu Asn Met Pro 
65 70 75 80 

Leu Leu Ala Tyr His He Trp Arg Tyr Met Ser Arg Pro Val Met Ser 
85 90 95 

Gly Pro Gly Leu Tyr Asp Pro Thr Thr He Met Asn Ala Asp He Leu 
100 105 HO 

Ala Tyr Cys Gin Lys Glu Gly Trp Cys Lys Leu Ala Phe Tyr Leu Leu 
115 120 125 

Ala Phe Phe Tyr Tyr Leu Tyr Gly Met He Tyr Val Leu Val Ser Ser 
130 135 140 



<210> 13 

<211> 1450 

<212> DNA 

<213> Homo sapiens 

<400> 13 
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cgaaaatgtc aagagaaagg ccaaagattt 
tcccttaaac ttctttctgt caaattagat 
atctgtttat gtagaatgat tttgttaaga 
aaagaggact gtgcattttt gctatttaaa 
aaaaatgaca agttttacta gctaagtaga 
ttcattatct taagtacatt gtacatttct 
ggatggtact tgcatatggt gaattactac 
agtggaccaa cattgtggca tggcagcaaa 
tacaagagac cctggttggt ttttcgtttt 
atcagcaggg atggaaggag ggtagggaag 
gaagtgtcac atttaatatc agtttttttt 
gaagaaagag gaagtgttca cttttttaat 
cagtagttct gaggtattga tagcttgctt 
gcagggtgaa taactagggc atatattttt 
tctttggaat ttccggataa gttcaggaaa 
acttatccat ctcattacaa acaaaaacac 
gaatacggaa gtaaattttc ttctttcctg 
ttttgttttt acaaatcctt tgggtctaat 
aatgtctgca tcatttcttt agttatccag 
ccagcccatg acaggttttt gwacatgtta 
ttgtgtaagg aagactgatg tagatgttct 
acgacgtatt ttagttccac atcacaatga 
aggcacattt cttgttgttg ttgttgttgc 
actgtaattt gaatcagatc tgaaagaggt 
aaaaaaaaaa 



gtactttttc acttacaaag cactcctttt 60 
ttaatgagag agtactattt ttaaggagct 120 
gtaatgtaaa ctattattga gtagaggcct 180 
ggaatcacaa atgatcatac ttaagtgagc 240 
gaaataaatc tcaaatgcag cgctacaatt 300 
acagaacctg tgattattct cgcatgataa 3 60 
tgttgacagt ttccgcagaa atcctatttc 420 
tgccaacatt ttg-tggaata gcagcaaatc 480 
gttttctttg ttttttcccc cttctcctga 540 
ttatgaatta ctccttccag tagtagctct 600 
aaacatgatt ctagttaaat gtagaagaga 660 
acactgattt agaaattcga tgtcttatat 72 0 
tatttctgcc tttacgttga cagtgttgaa 780 
tttttttttg taagctgttt catgatgttt 840 
acattctgca tgttgtatct agtctgatgt 900 
acagactgca tttgtagctc tgtaatcctt 960 
actttgacat tgtagctata ctgtttccat 1020 
tctgtgagcc tacctatagc actggattaa 1080 
ttaactttaa aactgttgta aaagtgtaaa 1140 
aagaacttca ttgttcagtt ttcatgatta 12 00 
gtgctgtcct ggaccatgtt aattacactt 1260 
tttgtcccca gtgacccttt tatcctttct 1320 
agttcccctt tgcattgtat tgctttgaca 1380 
ccagaataaa atatattttg atattaaaaa 1440 

1450 



<210> 14 
<211> 102 
<212> PRT 

<213> Homo sapiens 



<400> 14 

Met Gin Arg Tyr Asn Phe His Tyr Leu Lys Tyr lie Val His Phe Tyr 
1 5 10 15 

Arg Thr Cys Asp Tyr Ser Arg Met lie Arg Met Val Leu Ala Tyr Gly 
20 25 30 

Glu Leu Leu Leu Leu Thr Val Ser Ala Glu lie Leu Phe Gin Trp Thr 
35 40 45 

Asn lie Val Ala Trp Gin Gin Met Pro Thr Phe Cys Gly lie Ala Ala 
50 55 60 

Asn Leu Gin Glu Thr Leu Val Gly Phe Ser Phe Cys Phe Leu Cys Phe 
65 70 75 80 

Phe Pro Leu Leu Leu Asn Gin Gin Gly Trp Lys Glu Gly Arg Glu Val 
85 90 95 



Met Asn Tyr Ser Phe Gin 
100 



<210> 15 

<211> 3546 

<212> DNA 

<213> Homo sapiens 

<4O0> 15 
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gtggaattgg acactggtgc cccaagcgag 
cagacagtct tacmgaagga agaggagagg 
caagagcccc ctgatgaagg aacctcaggg 
gctttgtcct cmatggatcc tgaagtgagg 
tcatccagtg ttgatggtca gtcagtgggg 
aagaatgggc cmgaagaaga ggaagaagag 
gatgaaatgt catcagcttc tgaggaatct 
atggagaaac tgacttggct ggcatctgaa 
gaagagaatt ctcaggagga gaactctgag 
ggaatggaaa gcctgcagaa agaggatgaa 
gagaagcctc ctacttttgc ttcacctgag 
ccaccaggag agagcatcaa agctgctgga 
aagcggggaa gtcgggctcg ggccagcaag 
gaggacattc tcgagcgaga tccactcagg 
tatctgacca gggtgcgaga agccctacaa 
caagtcatct atgaatttga gtcaagtacc 
agcctgcaaa ttctgctcca agactggcct 
ttacctgagc aagctctggc ctgtggatta 
cgcaagttcc ttcggcagct ggagatttgc 
attatcaagg tcctccaagg ctgtgcagac 
acacagatgt ggcagctcct caagggccac 
tttgaccact tgcgcccagc agctagccgg 
gaggaaaagg agtatgagtt tgatggcttt 
gaggaggagc ctcccaagat acccacagcc 
gtccaaaatc atgataagga gactgaatgg 
tgccatgaag gaggtccaga ttccaagctg 
tgtagcagca aggtctgtga cagcaaatcc 
ggcagcagcc cccaccgaga ggctagtcct 
aaggatatga tggaagagga agccccagag 
aggactgtca ggaccaccag aaagggagag 
actttgccat cccctcgaga agtgactgtt 
cctcattcac cagagactcc tcaatttccc 
aagagaaacc aggttgggcc tgaggttcgc 
aaagagaggg agggccaaaa ggcagtgagt 
gcatcagaaa ctgagaaatt gcctggtacc 
tgtttcctca aagaccagag atgcagggag 
agagagatgg ctgccctcaa gcagagctcg 
ccatgaatct ccatcaggaa ttgacacctc 
tttggctaaa gacagtggaa cacaggccaa 
cgcagaagct acggtgtgtg ccaacaacag 
cctgtggaca agggaagctg accgtgtgat 
gccacagacc ttcaacatca tctcccagca 
ccaccgtttt cgagaactca tgcagctctt 
tgaggatgat gcaaccagta ccagcaatgc 
gtctgaagag gagctggatg aatgagactc 
caggcgccct ggcaccgggg agggggtagt 
gtttacagat ctggagagca ggaggccagg 
acccaggggc tctgccatcc taggcatcat 
tcctctgtaa atagcatcga gagtggagtt 
atggcacata tttattgttc tgtggtctaa 
atgttggtgt agctagtccc gcgacattga 
catccagctg tttttattgc aaactagctc 
gaggctgtca ccaccctggt agcactgggc 
tacgtcatcg tgctccactc cagcatccat 
ggtgaggtct ctggccaagg cttccagggt 
cccaaacact gggatgctct ccactgctgt 
gatgattatt ctgccctggg actcgccaaa 
gattgtaaac tcaacaaatt tggtgtattg 
gaatgctttt cttgggagcc atagggtgaa 
aaaaaa 



gagttgagca gtgctggaga agtaacgaaa 60 
agtcagccaa ctaaaacccc ttcatcttct 120 
acagatgtga acaaaggatc mtcaaagaat 180 
cttagtagcc ccccagggaa gccagaagat 240 
actccagttg ggccagaaac tggaggagag 300 
gactttgatg acctcaccca agatgaggaa 360 
gtgctttctg tcccagaact ccaggaaacm 420 
aggcgcatga gtcaggaggg tgagtctgaa 480 
ccagaagaag aggaggaaga agaagcagaa 540 
atgacggatg aagcagttgg agactctgct 600 
actgctccag aagtggagac cagcagaact 660 
aaaggccgga acaatcatcg agctcgcaac 720 
gacacctcca agctgctgtt gctgtatgat 780 
gagcagaagg acttggcctt tgcccaagct 840 
catatccctg gcaagtatga agacttcctt 900 
cagagacgga cggctgtaga tctctacaaa 960 
cagctgttga aagactttgc tgctttcctg 1020 
tttgaggagc agcaggcttt tgagaagagc 1080 
tttgcagaga acccctcaca ccaccagaag 1140 
tgccttcccc aggagatcac cgagctcaag 1200 
gaccacctgc aggatgagtt ttctatcttc 1260 
atgggtgact ttgaagagat caattggact 132 0 
gaagaagtgg ccctgcctga tgtggaagaa 138 0 
tcaaagaaca agaggaaaaa agagatcggg 1440 
ccagatgggg ccaaggactg tgcctgctcc 1500 
aagaagagca aaaggcggag ctgtagccac 1560 
tacaagagca aggagcccca tgagttggtg 1620 
atgcctggtg ctaaggaagc tgggcagggc 1680 
gagcgggaga gcactgaggc cacccagagc 1740 
atgcctgttt caggattggc agtggggagc 1800 
acagaacggc tcctcctgga tggcccacca 1860 
cccacaactg gagctgtact gtacactgtt 1920 
tcctgcccca aggcatcccc cagacttcag 1980 
gagtcagagg ctttgatgct ggtctgggat 2040 
gtgggaaccc cctgcttcct tcctgagtcc 2100 
aagacatgtg tccgggaaac cagacactca 2160 
ggtgaagaca agagacagga cgtgccctgt 2220 
agagacttct cccaaagccc ctagaggggg 2280 
gggtccagag ggggagcagc agccaaaggc 2340 
caaggtcagc tccactgggg aaaaggttgt 2 400 
cctcaccatg tgccaggagc aaggggcaca 2460 
gctgggaaat aagacccctg ctgaggtttc 2 52 0 
ccacactgcc tgtgaagcca gctctgagga 2580 
agaccagctg tctgaccatg gggaccttct 2640 
tgggaatcat ctacacagga ccaaacccaa 2700 
tgtactctgc ttgtacagtc cttgagccca 2760 
acaaggacaa aggctggagg atggagtagg 2820 
tcaaggtctt ttatgaagac tttacagatg 2880 
cagctccttt ctctactttt ttttggtctg 2940 
tcacagtgtt tctaaatgta aaaagtgcat 3 000 
gctcctctgc atgaagacac tgggctcctg 3 060 
ctttctccca cactgggaac tttagtccac 3120 
caggctttgt agctcctgca gcagctctgc 3180 
gaagctggcc cagcgccgca agtcgagttt 3240 
ctggtgcagg gacaaagagg aacacagtgc 3300 
ggagggagag gaaacagaga cctgtagatg 3360 
ctgataagga agtccaacct tagtagactt 342 0 
tccccttagt acaccagtac tccagaggaa 3480 
taaaggaatg tttaactgtg aaaaaaaaaa 3540 

3546 
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<210> 16 

<211> 677 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Asp Pro Glu Val Arg Leu Ser Ser Pro Pro Gly Lys Pro Glu Asp 
1 5 10 15 

Ser Ser Ser Val Asp Gly Gin Ser Val Gly Thr Pro Val Gly Pro Glu 
20 25 30 

Thr Gly Gly Glu Lys Asn Gly Pro Glu Glu Glu Glu Glu Glu Asp Phe 
35 40 45 

Asp Asp Leu Thr Gin Asp Glu Glu Asp Glu Met Ser Ser Ala Ser Glu 
50 55 60 

Glu Ser Val Leu Ser Val Pro Glu Leu Gin Glu Thr Met Glu Lys Leu 
65 70 75 80 

Thr Trp Leu Ala Ser Glu Arg Arg Met Ser Gin Glu Gly Glu Ser Glu 
85 90 95 

Glu Glu Asn Ser Gin Glu Glu Asn Ser Glu Pro Glu Glu Glu Glu Glu 
100 105 110 

Glu Glu Ala Glu Gly Met Glu Ser Leu Gin Lys Glu Asp Glu Met Thr 
115 120 125 

Asp Glu Ala Val Gly Asp Ser Ala Glu Lys Pro Pro Thr Phe Ala Ser 
130 135 140 

Pro Glu Thr Ala Pro Glu Val Glu Thr Ser Arg Thr Pro Pro Gly Glu 
145 150 155 160 

Ser lie Lys Ala Ala Gly Lys Gly Arg Asn Asn His Arg Ala Arg Asn 
165 170 175 

Lys Arg Gly Ser Arg Ala Arg Ala Ser Lys Asp Thr Ser Lys Leu Leu 
180 185 190 

Leu Leu Tyr Asp Glu Asp lie Leu Glu Arg Asp Pro Leu Arg Glu Gin 
195 200 205 

Lys Asp Leu Ala Phe Ala Gin Ala Tyr Leu Thr Arg Val Arg Glu Ala 
210 215 220 

Leu Gin His lie Pro Gly Lys Tyr Glu Asp Phe Leu Gin Val lie Tyr 
225 230 235 240 

Glu Phe Glu Ser Ser Thr Gin Arg Arg Thr Ala Val Asp Leu Tyr Lys 
245 • 250 255 

Ser Leu Gin He Leu Leu Gin Asp Trp Pro Gin Leu Leu Lys Asp Phe 
260 265 270 

Ala Ala Phe Leu Leu Pro Glu Gin Ala Leu Ala Cys Gly Leu Phe Glu 
275 280 285 
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Glu Gin Gin Ala Phe Glu Lys Ser Arg Lys Phe Leu Arg Gin Leu Glu 
290 295 300 

lie Cys Phe Ala Glu Asn Pro Ser His His Gin Lys He lie Lys Val 
305 310 315 ' " 320 

Leu Gin Gly Cys Ala Asp Cys Leu Pro Gin Glu lie Thr Glu Leu Lys 
325 330 335 

Thr Gin Met Trp Gin Leu Leu Lys Gly His Asp His Leu Gin Asp Glu 
340 345 350 

Phe Ser lie Phe Phe Asp His Leu Arg Pro Ala Ala Ser A_rg Met Gly 

355 360 365 



Asp Phe Glu Glu He Asn Trp Thr Glu Glu Lys Glu Tyr Glu Phe Asp 
370 375 380 

Gly Phe Glu Glu Val Ala Leu Pro Asp Val Glu Glu Glu Glu Glu Pro 
385 390 395 400 

Pro Lys He Pro Thr Ala Ser Lys Asn Lys Arg Lys Lys Glu He Gly 
405 410 415 

Val Gin Asn His Asp Lys Glu Thr Glu Trp Pro Asp Gly Ala Lys Asp 
420 425 430 

Cys Ala Cys Ser Cys His Glu Gly Gly Pro Asp Ser Lys Leu Lys Lys 
435 440 445 

Ser Lys Arg Arg Ser Cys Ser His Cys Ser Ser Lys Val Cys Asp Ser 
450 455 460 

Lys Ser Tyr Lys Ser Lys Glu Pro His Glu Leu Val Gly Ser Ser Pro 
465 470 475 480 

His Arg Glu Ala Ser Pro Met Pro Gly Ala Lys Glu Ala Gly Gin Gly 
485 490 495 

Lys Asp Met Met Glu Glu Glu Ala Pro Glu Glu Arg Glu Ser Thr Glu 
500 505 510 

Ala Thr Gin Ser Arg Thr Val Arg Thr Thr Arg Lys Gly Glu Met Pro 
515 520 525 

Val Ser Gly Leu Ala Val Gly Ser Thr Leu Pro Ser Pro Arg Glu Val 
530 535 540 

Thr Val Thr Glu Arg Leu Leu Leu Asp Gly Pro Pro Pro His Ser Pro 
545 550 555 560 

Glu Thr Pro Gin Phe Pro Pro Thr Thr Gly Ala Val Leu Tyr Thr Val 
565 \ 570 575 

Lys Arg Asn Gin Val Gly Pro Glu Val Arg Ser Cys Pro Lys Ala Ser 
580 585 590 

Pro Arg Leu Gin Lys Glu Arg Glu Gly Gin Lys Ala Val Ser Glu Ser 
595 600 605 
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Glu Ala Leu Met Leu Val Trp Asp 
610 615 

Gly Thr Val Gly Thr Pro Cys Phe 
625 630 

Asp Gin Arg Cys Arg Glu Lys Thr 
645 

Arg Glu Met Ala Ala Leu Lys Gin 
660 

Asp Val Pro cys Pro 
675 



Ala Ser Glu Thr Glu Lys Leu Pro 
620 

Leu Pro Glu Ser Cys Phe Leu Lys 
635 640 

Cys Val Arg Glu Thr Arg His Ser 
650 655 

Ser Ser Gly Glu Asp Lys Arg Gin 
665 670 



<210> 17 

<2U> 3052 

<212> DNA 

<213> Homo sapiens 



<400> 17 

atgatgcgtc 

gccgcagcgc 

cttaagatcc 

ggtgcgtcag 

ctctcctggc 

tggctgaccg 

ggcttctggt 

acttctggta 

catcccacct 

aagaatacat 

gctgctgtat 

aaatggatca 

ttgtgctctt 

ttcagttcat 

taactgtaat 

cagttatgtt 

tgaaaagtga 

ttggggggcc 

tgcccacgag 

ctgaaacttg 

atctgtgacc 

tttatatatt 

aacggaaagg 

gcttctgtgg 

gctgtttcca 

atttcagggg 

gcgtttacat 

aaaagatctg 

gagacatcct 

gtgtgtgcat 

tgtacagcaa 

cggggtcctg 

aacagccgtt 

gagtgcttga 

taaatggcac 

gatgctctgt 

cacctctggg 



cctgcctcgg 
cgggcgaccc 
tgaagaaacg 
ggaaatcatg 
tgaaaatgac 
ggtctggttc 
cgcctatgca 
ctctgtggtc 
gagaaccatg 
ggagagcttg 
taaacccgag 
tcactgcccg 
cactatgtat 
ctcctgtgtc 
cctgttgatc 
tggcacccaa 
gaagcccaca 
cccctcactc 
acccagaaaa 
ctcacagact 
aacagggcaa 
tatagtcaca 
tgtgtggcca 
agaatacttc 
atcatgaaga 
gctcctgctg 
agaaagacgt 
tgcactgaac 
ttgaccctct 
gtgtgtcaaa 
acaagctatt 
cccgtggtta 
cctgtgcggc 
gggccttgga 
ctaggtaaga 
gtggggcagg 
aaggggaggg 



ccgctgcagt 
cgcccgggcc 
gcacaaaatg 
cagccatcag 
aactatgact 
atccgtgacg 
gacttcgtgg 
aacggggtca 
ctcaccgacc 
cagctgaagc 
cgcgcccacc 
tgggtgaaca 
atagctctgt 
cgagggcagt 
ttcctgtgcc 
atccactcca 
tgggagcgga 
ctctggatga 
ggtggcccgg 
tccagttatt 
ctggaaccta 
gatggcagag 
cacgaagaag 
gggttattac 
aaaacagtga 
accccgccac 
tttggtctcg 
agtgaaggtg 
cagcaagtct 
attgccagtg 
ttttagaaac 
ctatgaatgt 
ccttcgttgc 
actgattttt 
gcagagctgc 
ggaggcatcc 
gaccatgagg 



cgccgccgcc 
tcggatccga 
ttcaagtgat 
gacacaggct 
cttcatcgtc 
gctgcggcat 
tgactttcgt 
tctttaactg 
ctggggcagt 
ccggggaagt 
actgcagtat 
attgtgtagg 
cttcagtcca 
ggactgaatg 
ttgagggtct 
tatgcaacga 
ggctgcgatg 
atccctttgt 
agttctcagt 
tatttggggt 
cacaaaccaa 
gaagaggctc 
ccaaacgccg 
atgggttatt 
atccagtgaa 
tcagcagtgc 
attagctccg 
gcttccggca 
gtgtgtgtgc 
ttgtttaggc 
cfgacgtttca 
attgctgttg 
cctcctgctt 
tttttttgtt 
ggctcggtga 
ttactggaga 
cagcc^gccc 



gccgccgcag 
tcacatagga 
gtctagaaat 
ccgggacgtc 
ctcctcctcc 
gatctgtgct 
catgctgctg 
cttggccgtg 
acccaaagga 
catctacaag 
ttgcaaaaga 
agaaaagaat 
tgctctgatc 
cagtgatttt 
tctgtttttc 
cgagacgga.g 
ggaagggatg 
gggcttccga 
gtgaggcgtg 
ctgaaggata 
ttgcttgcag 
tcagtcccca 
tggcctcctg 
caaatcctgg 
cagggattct 
actccccgga 
atgctttgca 
cactccccgc 
gtgtctgtgc 
aatgtaacat 
gggaagaggg 
gaggacatct 
tcatttttta 
ccagccaaat 
cttgatactt 
ggcagggccc 
ctggcagggg 



gccgggagga 
cagtatgcac 
aacttgtgag 
gagcatcatc 
gaggctgacg 
gtcatgacgt 
ccttccaaag 
cttgccctgt 
aacgctacga 
tgccccaagt 
tgtattcgga 
caaagatttt 
ctttgtggat 
tcacctccga 
actttcactg 
atcgagcgat 
aagtccgtct 
tttaggcgac 
gctcatcaga 
tcaacagctc 
caagcagagt 
cctgtacaac 
cagagctggg 
gtcctgagct 
ccaagcagtc 
tcacagcagg 
ctgaagttgc 
tgccccggaa 
gtgtgcgcgc 
ttaccggctg 
gagagagccg 
cgatccaaag 
aagaaatctt 
tagcagtgta 
ggggcagccc 
agccattggg 
cgactgtgcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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accgcaggca gcgctccagt cgggaatggc caggatggcg ccctcttgtt ggagtttttg 2280 
gttagctttt acgttttctt ctccacccac ggcacaggtg ataaaatagg atccttggtg 2340 
cggagcttaa aattatgcca gaaagccaac agctcccctc gtggggccut gccttaaact 2400 
tgcttggttt gtacattttt tgccggacgc atcaagaagc aatctgtgac aaagtctgag 2460 
ggtcttcctt tatgcttgcc ctccacatta agagaagttg ggtctccctc ctgggaattg 2520 
tttggccttt ctgttcatct gtgaactgtt ttttgttttt aattactctg taccccatcc 2580 
gaatcagggc ttctaccact gctgatgcaa aaccacaaag ggacctacct gagccaccgt 2 640 
cctagccaag cgagcaaacc tgcagggggt ttggaagtgg acttggtccic cgcagaagcg 2700 
tgtgcgccgt tgggggaaga gctgcgtcac agccagaggg acaaagtgtg ggtgatcctg 2760 
gagacgccag tttccgagat tgttctgcat attcatttgc acattgttgt ctgggttgga 2820 
catgcgtgtg ggcttcagtg tgaggctttt aatatgtata tcctgttatc aataaaacaa 2880 
ttatccaagt ggttgaatcc tgtgagactt ggcaagtgtg tgcaaatcaa gtatacttga 2 940 
cttttcaacc tcttctttca atgtaacttt tatatgaaat aaagtaatca attaacagtt 3 000 
ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3 052 

<210> 18 
<211> 308 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Gin Pro Ser Gly His Arg Leu Arg Asp Val Glu His His Pro Leu 
15 10 15 

Leu Ala Glu Asn Asp Asn Tyr Asp Ser Ser Ser Ser Ser Ser Ser Glu 
20 25 30 

Ala Asp Val Ala Asp Arg Val Trp Phe He Arg Asp Gly Cys Gly Met 
35 40 45 

He Cys Ala Val Met Thr Trp Leu Leu Val Ala Tyr Ala Asp Phe Val 
50 55 60 

Val Thr Phe Val Met Leu Leu Pro Ser Lys Asp Phe Trp Tyr Ser Val 
65 70 75 80 

Val Asn Gly Val He Phe Asn Cys Leu Ala Val Leu Ala Leu Ser Ser 
85 90 95 

His Leu Arg Thr Met Leu Thr Asp Pro Gly Ala Val Pro Lys Gly Asn 
100 105 HO 

Ala Thr Lys Glu Tyr Met Glu Ser Leu Gin Leu Lys Pro Gly Glu Val 
US 120 125 

He Tyr Lys Cys Pro Lys Cys Cys Cys He Lys Pro Glu Arg Ala His 
130 135 140 

His Cys Ser He Cys Lys Arg Cys He Arg Lys Met Asp His His Cys 
145 150 155 160 

Pro Trp Val Asn Asn Cys Val Gly Glu Lys Asn Gin Arg Phe Phe Val 
165 ^170 175 

Leu Phe Thr Met Tyr He Ala Leu Ser Ser Val His Ala Leu He Leu 
180 185 190 

Cys Gly Phe Gin Phe He Ser Cys Val Arg Gly Gin Trp Thr Glu Cys 
195 200 205 
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Ser Asp Phe Ser Pro Pro lie Thr Val lie Leu Leu He Phe Leu Cys 
210 215 220 

Leu Glu Gly Leu Leu Phe Phe Thr Phe Thr Ala Val Met Phe Gly Thr 
225 230 235 240 

Gin He His Ser He Cys Asn Asp Glu Thr Glu He Glu Arg Leu Lys 
245 250 255 

Ser Glu Lys Pro Thr Trp Glu Arg Arg Leu Arg Trp Glu Gly Met Lys 
260 265 270 

Ser Val Phe Gly Gly Pro Pro Ser Leu Leu Trp Met Asn Pro Phe Val 
275 280 285 

Gly Phe Arg Phe Arg Arg Leu Pro Thr Arg Pro Arg Lys Gly Gly Pro 
290 295 300 

Glu Phe Ser Val 
305 



<210> 19 
<211> 3123 
<212> DNA 

<213> Homo sapiens 



<400> 19 

ggatgctgga 

acaattcagc 

gataacaaca 

caaggctttc 

tcatcaatca 

gggtcagcca 

gaaattccaa 

cctcctcctt 

ctaaaaataa 

gaccttttaa 

gaaaagagga 

aaaatcccca 

tcagaaaaac 

tcttctgttc 

tggtccaagg 

gaggttcata 

aaccagccag 

cagtatgaag 

aagattcgga 

gcagagaaag 

attgataaac 

gaagttaaaa 

gcaggggagg 

cacacaagtg 

gcaaggaaat 

tgtaacatgt 

gatgggaaat 

ataagtgtca 

ccaacgcaga 

aagcaacaga 

ccaaagaaga 

ggattacaga 



gcagaacaat 
aaccacctca 
gtgacattgc 
agtacccagc 
gtgtgtatga 
atggctttgg 
acacaagacc 
cggtaccaca 
ccaaaactat 
atgaagtaca 
aaaaaagcaa 
aattagaacc 
caagggaaga 
caacaacgga 
tgggaaccta 
ctaaaattaa 
agagggcgtg 
aattactggc 
aaccccgacc 
cattgaaaat 
agcctgaaga 
aaacccgacg 
tggcctcctc 
cggaagagga 
ccttaccagc 
ctccagttgt 
ttatcgatca 
gggggcaaga 
gtgtatcatc 
gaagatcaat 
agatcaaaaa 
aagggtcggc 



ggatttctct 
actcattgac 
tgaagatggt 
tacaacagaa 
aactcaaacc 
tgcagttaga 
acatgaaatt 
aactgtgatt 
ccagaatggc 
ggcaagtgag 
caagcatgac 
agaggaacaa 
accagtacta 
agtgtccact 
tccttggtgg 
cacaagaggt 
ggttcatgaa 
tgaggcaacc 
tcagagagaa 
gactcgagaa 
ggctttatcc 
accaagatct 
actctcaagt 
agagccaccg 
ttccattacg 
gaaaattgaa 
atttgtttat 
caggcttata 
tcctgaagca 
tagaactcgt 
ggagcaggtt 
ggaccgggga 



ttctctttca 
tccgccaaca 
ggccagacac 
gatcttcctc 
aaataccagt 
aactttagcc 
ctggaaaaac 
ccaaagaaga 
agggaattgt 
cacacgaaat 
tcatcaagat 
aatagaccaa 
aaagaggaag 
ggtgttaagt 
ccttgtatgg 
gcccgagaat 
aaacgggtac 
aaacaagcca 
cgtgctcagt 
gaaagaatag 
caagcaaaaa 
gtgctgaata 
actgaaattc 
cctgttaaaa 
atgcacaaag 
ca'agtgtttg 
tcaacaaagg 
atttctacac 
acatctggtt 
tctgaatcag 
gaaacagttc 
gtgcagggct 



tgcaagggat 
tccgtcagga 
catatgaagc 
cactcacaaa 
catataatca 
ccactgacta 
cttcccctcc 
ctggctcacc 
ttgagtcttc 
caaagcatga 
ctgaagagcg 
atgagagggt 
ccccagttca 
ttcaggttgg 
tttcaagtga 
atcatgtcca 
gagagtataa 
gcaatcactc 
gggatattgg 
aacagtatac 
agagtgttgc 
ctcagccaga 
ggagacatag 
tagcctggaa 
ggagcctgga 
ctcttcagaa 
gaattggtaa 
caaaccagag 
ctacaggctc 
agaaatccac 
ctcaggctac 
ctgtcagatt 



catgggaaac 
ggatgccttt 
tactttgcag 
tgggtatcca 
gtatcctaat 
ttatcattca 
acagccacca 
tgaaattaaa 
cctttgtgga 
aagcagaaaa 
caagtcacac 
tgacactgta 
gccaatacta 
cgatcttgtg 
tccccagctt 
gttttttagc 
aggtcataaa 
tgagaaacaa 
cattgcccat 
ttttatttac 
ctccaaaacc 
acagaccaat 
ccagaggcgg 
aactgcggca 
tttgcagaag 
tgctacaggg 
caaaacagaa 
aaatgaaaag 
agtagaaaag 
tgaggttgtg 
agtgaagact 
cagtgacagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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tccgtctccg cagcgatcga ggaaactgtg gactgagatt cctgcacaat ttcatcccag 1980 
aaactccaga cttgtagtct ccatgcaaga tttctttgtc ggcggcttga taaacagttt 2040 
ctttgttttc gattttgatt tcgccaatca tcattattgg cattttcctg cccggtttct 2100 
tcttcaagac tctgaacaat tgctttaaca gtcaaatgat tttttttttc ggtttgagct 2160 
ggatgggtac agcttaaatc atgggtccag cctaaaaacc accatttaac ttacactgat 2220 
caatttcaac atggactgtt tttggttttt tgtttttaaa taaagcatca ttaatgcaca 2280 
tctgcagggg tttgccaaac agcccaaact gtatacatta caatcattaa aagttcttat 2340 
tttttttaat attagtgccg ttatcatgga gaacagcatg acagctgtct ttggcagtct 2400 
gtcatttttc tagcattttc agaaactcat cggaaatggc ggtacctgtg tttcccttcg 2460 
aaagcctctc agtacagcac tcctgttcct ctgttaaaac tccttgttaa tccagtgatc 2520 
ttttaggcca aggaaatatt ttgtgatggt gttctgggtc catacaccag caatgaagga 2580 
gatagatttg tgtacttgtg ttttttaatc agcattaaca tgggcaggca ccctcattta 2640 
tagatgtcag gaaacattca gtgaaaaact tgtagaatgg gatgtgataa cgaggttcca 2700 
gtaatctgag cagtctaacg aggcccacct cctccaccac agaacgtggc tatgttccaa 2760 
gtgctactct cactcagcct gttgcggatc ttcatggcct caggagactt gtttctccat 2820 
gggctcttct ggactgcaca cttccaccat agcttgctgg gttgatctag atgtctgttt 2880 
gttgtatgga aattttgggg gaaaaaatcc aaaacacaaa ctgtgggttg aaatattaac 2940 
cgtctccttg gttccttggt attcaccgtg cctgatctgc acatttcatc gtggctgttt 3000 
ctgtatagcc tatactgcat tagcccaaga gattgttgct ttgtaacttc ttgcactatt 3060 
gttttggctg gatttgtatt acacacagtt ttaaaaaaaa caataaaaaa aaaaaaaaaa 3120 
aaa 3123 

<210> 20 

<211> 645 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Asp Phe Ser Phe Ser Phe Met 
1 5 

Gin Gin Pro Pro Gin Leu lie Asp 
20 

Ala Phe Asp Asn Asn Ser Asp He 
35 40 

Tyr Glu Ala Thr Leu Gin Gin Gly 
50 55 

Asp Leu Pro Pro Leu Thr Asn Gly 
65 70 

Glu Thr Gin Thr Lys Tyr Gin Ser 
85 

Ala Asn Gly Phe Gly Ala Val Arg 
100 

His Ser Glu He Pro Asn Thr Arg 
115 120 

Ser Pro Pro Gin Pro Pro Pro Pro 
130 135 

Pro Lys Lys Thr Gly Ser Pro Glu 
145 150 

He Gin Asn Gly Arg Glu Leu Phe 
165 



Gin Gly 
10 

Ser Ala 
25 

Ala Glu 



Phe Gin 



Tyr Pro 

Tyr Asn 
90 

Asn Phe 
105 

Pro His 

Pro Ser 

He Lys 

Glu Ser 
170 



He Met 

Asn He 

Asp Gly 

Tyr Pro 
60 

Ser Ser 
75 

Gin Tyr 

Ser Pro 

Glu He 

Val Pro 
140 

Leu Lys 
155 

Ser Leu 



Gly Asn 

Arg Gin 
30 

Gly Gin 
45 

Ala Thr 

He Ser 

Pro Asn 

Thr Asp 
110 

Leu Glu 
125 

Gin Thr 
He Thr 
Cys Gly 



Thr He 
15 

Glu Asp 

Thr Pro 

Thr Glu 

Val Tyr 
80 

Gly Ser 
95 

Tyr Tyr 

Lys Pro 

Val He 

Lys Thr 
160 

Asp Leu 
175 
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Leu Asn Glu Val Gin Ala Ser Glu His Thr Lys Ser Lys His Glu Ser 

180 185 190 

Arg Lys Glu Lys Arg Lys Lys Ser Asn Lys His Asp Ser Ser Arg Ser 

195 200 205 

Glu Glu Arg Lys Ser His Lys lie Pro Lys Leu Glu Pro Glu Glu Gin 
210 215 220 



Asn Arg Pro Asn Glu Arg Val Asp Thr Val Ser Glu Lys Pro Arg Glu 

225 230 235 240 

Glu Pro Val Leu Lys Glu Glu Ala Pro Val Gin Pro He Leu Ser Ser 
245 250 255 



Val Pro Thr Thr Glu Val Ser Thr Gly Val Lys Phe Gin Val Gly Asp 
260 265 270 

Leu Val Trp Ser Lys Val Gly Thr Tyr Pro Trp Trp Pro Cys Met Val 
275 280 285 



Ser Ser Asp Pro Gin Leu Glu Val His Thr Lys He Asn Thr Arg Gly 
290 295 300 

Ala Arg Glu Tyr His Val Gin Phe Phe Ser Asn Gin Pro Glu Arg Ala 
305 310 315 320 

Trp Val His Glu Lys Arg Val Arg Glu Tyr Lys Gly His Lys Gin Tyr 
325 330 335 

Glu Glu Leu Leu Ala Glu Ala Thr Lys Gin Ala Ser Asn His Ser Glu 
340 345 350 

Lys Gin Lys He Arg Lys Pro Arg Pro Gin Arg Glu Arg Ala Gin Trp 
355 360 365 

Asp He Gly He Ala His Ala Glu Lys Ala Leu Lys Met Thr Arg Glu 
370 375 380 

Glu Arg He Glu Gin Tyr Thr Phe He Tyr He Asp Lys Gin Pro Glu 
385 390 395 400 

Glu Ala Leu Ser Gin Ala Lys Lys Ser Val Ala Ser Lys Thr Glu Val 
405 410 415 

Lys Lys Thr Arg Arg Pro Arg Ser Val Leu Asn Thr Gin Pro Glu Gin 
420 425 430 

Thr Asn Ala Gly Glu Val Ala Ser Ser Leu Ser Ser Thr Glu lie Arg 
435 440 445 

Arg His Ser Gin Arg Arg His Thr Ser Ala Glu Glu Glu Glu Pro Pro 
450 455 ' 460 

Pro Val Lys He Ala Trp Lys Thr Ala Ala Ala Arg Lys Ser Leu Pro 
465 470 475 480 

Ala Ser He Thr Met His Lys Gly Ser Leu Asp Leu Gin Lys Cys Asn 
485 490 495 
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Met Ser Pro Val Val Lys He Glu Gin Val Phe Ala Leu Gin Asn Ala 
500 505 510 

Thr Gly Asp Gly Lys Phe He Asp Gin Phe Val Tyr Ser Thr Lys Gly 
515 520 525 

He Gly Asn Lys Thr Glu He Ser Val Arg Gly Gin Asp Arg Leu lie 
530 535 540 

He Ser Thr Pro Asn Gin Arg Asn Glu Lys Pro Thr Gin Ser Val Ser 
545 550 555 560 

Ser Pro Glu Ala Thr Ser Gly Ser Thr Gly Ser Val Glu Lys Lys Gin 
565 570 575 

Gin Arg Arg Ser He Arg Thr Arg Ser Glu Ser Glu Lys Ser Thr Glu 
580 585 590 

Val Val Pro Lys Lys Lys He Lys Lys Glu Gin Val Glu Thr Val Pro 
595 600 605 

Gin Ala Thr Val Lys Thr Gly Leu Gin Lys Gly Ser Ala Asp Arg Gly 
610 615 620 

Val Gin Gly Ser Val Arg Phe Ser Asp Ser Ser Val Ser Ala Ala He 
625 630 635 640 

Glu Glu Thr Val Asp 
645 



<210> 21 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<221> misc_f eature 
<222> (2) 

<223> biotinylated phosphoaramidite residue 
<400> 21 

gntggcattc caggtgtggt ggtgcacag 

<210> 22 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<221> mi sc__f eature 
<222> (2) 

<223> biotinylated phosphoaramidite rer iue 
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<400> 22 

cnaggctaat gaggcagcaa tggcttaaa 



29 



<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<400> 23 

tcttcactgt atctcctttg c 21 

<210> 24 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<221> misc_feature 
<222> (2) 

<223> biotinylated phosphoarainidite residue 



<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<221> mis cofeature 
<222> (2) 

<223> biotinylated phosphoaramidite residue 
<400> 25 

anattcttgt gtagaagcga actgtccag 29 

<210> 26 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide « 
<220> 

<221> misc_f eature 

<222> (2) 

<223> biotinylated phosphoaramidite residue 



<400> 24 

anaactccca gaaggaagga gatggatcc 
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<400> 26 

anggcgaaga agatgagcgc ggcagtgag 29 

<210> 27 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<22l> misc_f eature 
<222> (2) 

<223> biotinylated phosphoaramidite residue 
<400> 27 

gnaagaagtt taagggaaaa aggagtgct 29 

<210> 28 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<221> misc_f eature 
<222> (2) 

<223> biotinylated phosphoaramidite residue 
<400> 28 

cnctttttcc tcttgfctctt tgaggctgt 29 

<210> 29 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 
<220> 

<221> misc_f eature 
<222> (2) 

<223> biotinylated phosphoaramidite residue 



<400> 29 

anggatcaga gcatggactg aagacagag 

<210> 30 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 



29 



<220> 
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<221> misc__£ eature 
<222> (2) 

<223> biotinylated phosphoaramidite residue 
<400> 30 

anatagacca aatgagaggg ttgacactg 29 



</;10> 31 
<211> 87 
<212> PRT 
<213> Homo 



sapiens 



<400> 31 

Met Glu Met Gly He Gin Val Gly 
1 5 

Glu Leu Gly Glu Leu Leu Thr Lys 
20 

Met Gly Ala Gly Trp He Gin Ala 

35 40 



Lys Arg Leu Leu lie Gin Gly Leu 
10 15 

Cys Gly Asn Glu Asp Phe Leu Lys 
25 30 

Ser Tyr Phe Asn Pro Gly Leu Thr 
45 



Leu He Gin Ser Lys Tyr Phe Leu Ser Ser Ser Trp Gly Gly Val Leu 

50 55 60 

Gly Gin Glu Lys Lys Gly Asp Lys Asn Thr He Pro Glu Lys Glu Glu 

65 70 75 80 



Lys Tyr Leu His Phe Val Leu 
85 



<210> 32 

<211> 82 

<212> PRT 

<213> Homo sapiens 



<400> 32 

Met Met Phe He Val Pro Val Val Phe His Pro Arg Ser Arg He Ser 
1 5 10 15 

Phe Ser Thr Phe Ser Phe He Arg Val Met Lys Leu Asn Pro Trp Ala 
20 25 30 



Met Ser Glu Ala Gin Ser Leu Glu Cys Val Tyr Ser Gin Trp Cys Met 

35 40 45 

Tyr He Leu Cys Leu Asp Ser Leu Arg Ser Val Ser Glu Asn Leu Asp 

50 55 60 

Ser Ser Leu Leu His Lys Asn Phe He Cys He Tyr Glu Asp Asp Ser 

65 70 75 80 



Val Pro 
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FIGURE I A 




BamHi 2772 CJaJ 266$ 



Ptosmld name: pED Sdpc? 
I*lzism*d size: $374 ftp 



Commentaffltefertncvt: P ED6dpc£ is cfctived irarn pEDGdpci by IrwsrUon of a new 
pol^inke/ lacilitate cB^A cloning. AST CDNAs are cloned beiween E<ttRI artd Noll. 
pED vector Ard described in Kaufman £l NAft 13: 4*$$4490. 
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f*CTflJS5WEI6*46 



FIGURE IB 




Plaamld »fiA: bp 



2frhnni»rrtfl/lt*f*niK.gw pNOTs 5 derivative b| p^72 [Kai>1rYi£r> nrf *|,i9afl. W^.c*||.pio|.&:t74i-i7Sftj, 
DNFR wa s del&lad u n <3 a. new paiyftik* wrb insert behwwi EcoKi and Hpal. Mis or^in 
o# r^r^icflliDn was Ine^rtocJ j, ihe Olal &I1m. SS7 cDNAa ar* donad betaettt E<*>Rt and 
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SBfrjSWEE LASTING 

WcCoy, Conn H. 
Lavaaiic, Edward R. 
Cnllin5?-Gl*ci&, I/ifca A. 

MetU&cg-, no via 
TxtKLgy, Maurice 

StiSinincroir ir, RChhetft il. 
Uenetica inatitute, inc. 

<120> StefiHJSSTBP PROTBIMS A>1d POLYWUCLEDTID29 BMCODIMG THEM 

<17G>- Patently vet. 2.0 

<2l0> 1 

<2LIt- 2i6D 

<2J.2> PJ1A 

■:21S> EtOOYto sapiens 

<220;> 

<221> unsure 
*223* [33*3 > 

P> 1 

ccactfeee.ee cttttgetaa tfeaaaa^taa gaagtcgcraa tgggaacgag gt-gcceaffefc 6D 

efcttftggaga aagcttaaatf acoQE^cgcc Sflftfrfcttticc tgccCtCCtt -eeflBnaegga 120 

aagaggagct cctatftttoa* ttaaflCCtf&v gtagggctgg ttfitcotttc cg^gectfMia lftd 

tcGcagsctffc tj^tgaAtta tflfaacgtgcn acaccatffaa gctettgtgg casgtaacfctf 2-JO 

t£caccacoa c 9 <?Ctg^aat gccatpctgc CCtffiflt togt ctacctcac* i^caagtgt J DO 

ggatt^tgtxg tgcaaecBte go fcffCt ate* cctcagcc^g ffetCc^MD tgvwc-tccg 

tctgctcflfea ^Bgtaa^cag ttta^Magg tgst^kGcac gcggcggggG et^tccgasrn 420 

t<tce^caggg tftttCCCttffj aecacccggt ae-iteaacct cat-ntfa^aac aacBtcGaga ISO 

tg^tccaggc c£?acacottg c^o-Gafcctco accacctg^a gattctgcag t.tgggcagga 54 & 

acdccBtcc« flrcagattpgag crt^ggggcct tcaacese*t ggpcftgcctc aaGaccetsj^ 60D 

Afffitgttcga c^act^g-Ct^ acagtcatCC C tagcggggr cfctCffaatdtt C-tgtccaag^ t$P 

tgcgggasct ffdosattcgc aacaaee-cca tgga*ag C at ccectcttai? gwttcaacrc 73d 
sgtftgccete octcatgcg<3 ttgsacttgg sregasyetcaB gaagctggag tatatetet* TBG 
agggagcttt t^a^gQfflGtg ttcaacctca a^tatctgaa CttCfiftf^atfl tg^aoattA 840 

Aagacatgcc caatctcacc cccatggtgg ggetgea^B uatgg*g*t* tcassrcaaac 3 0fl 

actfcc^ebga gatcaggcct ggctpgttcc atC^Cctgag ctCCCtcaafl fcd^tctggg SSQ 

tnatg flac -tc acafftftiiag,? ctflattffigc ggaa^gcttt Ctfocgggctg gcttC*Cttg 1M0 

tggaactcaa nttggeccac aataae^tct cttctttgcc ccatgacctc tttace-ec 9 c 1D9D 

tgoggta^t ggtg*JB#ttg cstctatacc aoaac<?ctte sa^tgtgat Lgtgo^attC? IXiO 

tgtggctaffC ttggtggctt C^agagt^ta cacCCaeCBa ttqcncctgc LffCffflCCfifet 120D 

ffteatg C t* C catgca<2atg cg A ggcC£Ct ucntcgtgga sgtg 0 ate E g accteeWTC 1260 

agtgctctgc ^cecttcatc atacr-scgcac c;tcg5igaccL eaaoatttct gae^tcflca l^Zfl 

tfifecragaaoL LaAgtgtcffg a Ctccccc ta tatcctc egt gaagtaBttst c-t 9 n« D tg a^eo 

gg A W.gtgct cafffleaogcc tGCGCCCace ^a^ggatCCC t 0 tD C t«^ g Q cggcacct 1440 

tgaactfctt* ccacgcgcta ttttcagaca ctewfrtgta cn Cn tgcat£r gtaoi-eABtg isdq 

tt^caggcaa C:tCCaac 9 D C tpggcctact tcaBtgtgag CRCflee trrag ettaacacc-c 
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ccwctacag ct tCttCoac acagtaacae tggagaccac p^agatat i?g -sctgasraafca l-6?0 

cAa^g^cg-aaa fftacaaguat gttactasca ea-tceactgg ttAcragccg acatatacca 1$&0 

cctctacca* yatgctcatt caGactaaoa gtgtgoc-caa gcaffgfcg-fl-ca utarccggga 1710 

CAfidcAccac tgacaagafcQ cagacgagcc tg-gateaagc CatijtaacrGac aq^aagatCR IS DO 

tcattggctg ctttfirtsrgoa gtgattgtga taaetsccgc c&tgttgatt gtettctfita LSfcp 

aacttc^taa geggDaq-cag cage (jg-agta c?agtca<?Agc cgccctfgaet jyttgagafcaa 1^20 

tCda^gtBPga cgaagacatt c^ageagcaa catccgcajK* agrcaacagoo gctcogtcc^ 1930 

gtgtatx;aeg tcaagSggc* gtagtgct^er ^AcaattM tgaccatatc aac-fcaeoaca a 040 

cctacaaate agcacatggs' cetcactg^a oagaaaaeag cctfiyflff^aac tatctgcfltcc 2L0D 

ccacagtcag cactatctet gaa.<rcttftta taatfccaffac cnatAccaag gacaagflt&c 

ag^aaactCR satatgaptc CCtC^Circaa ataocttata asatccaata ^aatgcaoac? 

aaa&acagc-a apttttgtBC agagtgggga gagacttttt CfttCrtafcatg cttatatatt 2.2B& 

aagtctaCgg gcfccrettaca a&aaacasat fcatattaa&a tttaangaca aaaa-stcua 23^} 

acivaaaaaaa aaaajsa.ftguji.a ^3 6<i 

<21L> S53 

<2ia> prt 

<213> Hojtko sapiens- 
<4QDf> 2 

Ket Lye Leu Leu Trp gin Val Thr Vcl Kia Hi?? His TJir Tfp Afsji Ala 

1 & 10 15 

He T-<9U Leu Pre? Ptua V*l Tyr Leu Thr Ala. a It; Val Trp II* Leu Cya 
2D 25 3 fl 

Ala. Alfl rla Ala Ala Ala Ma Ser ALa Gly Paro giln Asn Cya Pr* fier 
35 45 

VaL Cy* e©r Qy3 Sex: Aon Gin Phe Sar Lye- Val V&J, Cye Thr Arff JLrg 

55 

Ely Leu Bar Giu Val Pro Gin Gly lie Py* S=r As* Tin.* Arg Tyr LeU 

55 70 75 eq 

AfiSrt Leu Wet l?Xll ABtl Afin ll* Gin Met lie Sin Ala Aap T'hr Fhe Ara 

B5 9Q 95 

Hifi Leu Hia flig JL*u GLu VaJ, Leu Gin Leu GLy Axg jRjpn Sex lie ATQ 
100 105 " no 

Gin 11= dlu vaL Gly Ala Fh* Aert Gly Leu ALa 3er L*u Asn T-hr Leu 
11 5 120 125 

Glia L«u Pbi* Aap Aan Trp ^ -Thr Val lie fcrn a*r -SJ,y Ala Ptie Glia 
130 135 140 

Tyx Leu ^er Lya- Lfcu Arg Gly Leu TJffc Leu Arg Ajs»n AftTL Pro He Glu 

"5 LSO tS5 164> 

II* Pro Ser T>r Ala Ptle Aan &rg V*\ p rD Sex Leu Wet JLrg Lo 4 

165 170 173 

ASP LfiU. Gly GLu l^u Lys Ly^ Lipu Glu Tyr lie ficr dl v Gly Ala Phe 



1?0 



GHu GLy t^v Ph9 AS-il Leu Lya Tyr Leu Ain Leu n3y Hat Cye AKfl lie 
1S5 20D 20s 



WO 



hys Aap tfet Pro Asn Leg Tfir Pro Lea Val Gly Lea GQu Glu L^u Glu 

2L5 220 

Met s&r Gly Aen Hia Ph« Pro Glu lie Jurg Pr* Gly &er Ph* Hie dly 
225 250 23b 2^0 

IrfJU. Ser SQI LfiLL Lyc Lys L*U Val Kat Asn Ser Girt Val £tsr Lou 

245 £5-0 25 e, 

ILa Gla Arg Aah Al* pfcw Aop Gly Le^ ALa S«r Leu Val aiu Lau abq 

2 §S 265 ^70 

Leu JUfl Hi ft Acn Aai} Lqu Ser £=r L^ Pro His Acp Leu Ptw Thr Pro 
^ 360 

Le-u Ajjfr Tyx Leu Glu Leu £tia Leu. KiJB ILifi Asn. Fro Trp ABU Cye 

2 *° :5D0 

Afl£ C!yA Acp 11q LBU TTE- 1-t-u Al* Tr? Trp Leu Arg Ij2, w Ty r ria Pro 
3QS 310 W5 J2D 

Thr Afcii £er Thr Cyg Cye Gly Arg Cy B His Ala Pra tf«t Kla Wet Ar^ 

3 JO 325 

Gly Ar&r Tyr Lau Val Glu Val A*p Gin Ala Ser Phe Gin CyB Ser Ala 
340 345 150 

Pro Phe lie Met A*p Ala Pro Arg Asp Leu. Jttn Lie b**- sly Gly Ax a 
35B j$ 0 36£ 

*feC Ala GlU L-=rp Ly* Cy3 Artf Thrr Pro Pro Met Ser i3ar Val LyB Jt TT£> 
370 3?& 3e0 

LtfU L*U Pro Aert Gly Tbr Val Leu ear KIb Ala S*r Arg- Hi* Pro Ar-3 
^ 335 " 4 {i D 

rl* s«rc Val Leu Aca Agrp Gly Thr- l=u ft Sn Phe- S*r Hie vol l*u Lew 
405 41B ^15 

3or Aap Thr Gly Val Tyr O-br Cye Jflet Vial Thr AAil Val Ala Rly Aan 
AM 425 430 

£9r Aen Ala Ser Alo Tyr Leu A&n va.1 s^er Ttir Ale Glu l^u Aen Tbr 
^35 i^Q 4 ^ 5 

Aeci ^r s*r PH- ph« Thr Thx Thr VaL am iPhx Thr Ola He 

«0 455 ^0 

Pro Afip ttur Thr AJr* Ly* ryx Lya Pro Vol Pr^ Thx Thr Sfir 

4B5 475 4&0 

rlir Gly Tyr dim pro Ala Tyr Thr Tbr s*x Thr Thr Val Leu lie <n n 

Thx Thi Arg Val Fr& Ly 3 Pin Val Ala Pro Ala Thr; Asp Tttr Thy 

5DQ 505 510 

Aap Ly,a Met Glci Thr s©r L«u Ai-p al 4 val Met Lye Thr Thr Lys lie 
515 52.5 
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31 e Ilis Gly Cya tfba Val Ala Vai 31& VflL ten Ala Ala Ala Met Ltiu 
530 SJ5 

II 8 Val Phtt Tyr J^rg Leu Axg Ly* Arg Ki& Girt Gin Axcr Scr Th* Val 
545 550 56<I 

Thr Ala Ale Are Thy val Glu He. r] e GLtl Val Acp CUu Asp 32,© Pro 

Ala Ata Thx £dxt Ala Ala Ala Tfcr Ala Ala Pjo fiex aiy v&l Ser fliy 

5B5 5.90 

Glu Sly Ala Vol Val Leu. JPr* Thr lie Hie Asp Asn Tyjf Afcn 

6uO g0 5 

Thr Tyx Lyg Pro ALa His Ply Ala His Trp Thr <?U Aen Sar Lfitt Gly 

WO 

Aan b*x Leu Hie Pr& Thr val mur Thr lie sai gLti Pro T-yr II* il» 
sa& «0 &S5 64d 

Gin Ttur His Thr Lyfc Asp JLyg V Rl Gin tola ThlT Gill lis 



<Sll> 155B 

<212> DfeA 

<213-^ k«mk> aapi-fihe 

<40D> 3 

gcgaogctga. taaffccAata agagcgGtah ctagaficg^a tttgrcaaacc a ttaagogfca fio 
gcttcttttt aapgcctgga cttet^atte *tctafcagaa fftefcattatg Btt^catcaa isa 
tCffCfltafrtt tgSgtaCGtflf ytagaaattg tgtet tflftaH tgacoctfctp gagtSattSa 1BQ 
catggctetg atgaatacraa a?tgagcccc A a* a eta* at ^«aj,**g 55L a tttfcotatct 240 
tLqatcwec acat^tggca a-sacae-jt eg ge^tttctt accca«asfft ctttcgtgtg 3DO 
actgeatett tot^tacgit tttBMttgt stgetttee* ttttgtcttt *gtgcctafca 36Q 
ctetta^fftg ttttcttcac tg^ACtaac aaatttoagc qattgstscc tcattngcot «o 
tgtattfctgt sjteeatat^a tatatteaga ectgtatgtt cactttaaae attuttatat 4B0 
cacact^tct cctqatctac eatatggt**. at^agefagae agtgtcttat. atcaeaettft 540 
ctcetoatot accatat^gt aa. n tgttaaa actccacatt tgtctgsatc aacfoaaaatg eoo 
catggg^aca catcjotc-eet cectctc tet *tg<?tctcat Dccttcrcittc aggcptcfcta 65 D 
gcatfcfltttg tttfcuccatt Lctgata^ta ctactccat^ ct^acgattt gecatattae 720 
tafctttggaa acacteagtg atagsactcc tagoaruMitt goaaagaa^t gutac-atar-t 180 
gtatAtcr*aa ccetcagatt ctaflffc^tLgo aaaagtagee tatacttt^e tati;a.«ttaL &4D 
acedgcttfe* atagaaaaaa ataagtttafc tcac^acata tttaeatttg atcattaaata gciC- 
aiagaaoasft gggcactttt tt^gagcta^ tcatggtagt; cattagtgat ftLttctgaac 96& 
agttcttaat ttaaaacact tcaaaggaag t^aaggtcat Qfflet tft^ctg aaggaaatgc 1D2D 
tccaaaaafct ggactgeofta aoccatcagt acaataatae gctgtgtstg tattft^tata JPSO 
taaaatgosa abta£g?cat aattffflajjca tttgcattaa tcaacaaact canattgnga 1WD 
caaaacttag ttttua^gct gtettgatta aagccaaatfl ttonatgttg ptgtsaaffan 1^00 
taff6ctEtt * caaatacttt ggaaagta^t tanttgg^aft cttgtaaag^ tattecattt lAfiD 
ttatatttaa ac^cctAtas a?attttc*a ttcctcgafft ctga^cttgL gggtggaatt 1J2D 
ctaaatttgt Atcataatct gtc-ttttgtg aaacattttg aaaatatgta tatataatat 1380 
tgtatatgea aattgtgttg ttt«cttgt *ajagggaaaa accttatttL tttttatatt U4D 
fcGtgatanct Lgttttgeat otga^cagca ^t^attgaan ggcatgtgt^l ffGtfletaaaco a^OO 
ntgttgcttt btttr^tgaafl cgaaaafcaaa agtRttt^ct5* tacaaaeaaa aaaaaa^n 15 5a 

<210> -4 
<2U> 71 



RKicnnrin- ^a/o QQ^n/tn^ a 1 ti % 



*Z1Z^ FK.T 

<212> "Hcfno sapient 

fet siw pro Lye ?<hr £,ya g OT Lye A* n Acn, Fh* Leu Ser Phe- il« Pro 
15 W is 

J*** Met Ttl> Gin Aiip Lya L<5*U Ala L^u Ser Tyr Pro Glu Val Phe Cya 
2D 35 30 

Vftl Thr Ala SfiJf Ph.* JStir Bar V&L Leu Hi a Cya Val Leu fi W Us. LeU 
35 40 45 

ser leu Val Pxo it* Leu Leu Crty tr B i Pti« Phe -Tfcr- #l v Tie His Lv* 

55 $0 ' 

Pha Lys pyg Lea i^ pro- ffLn 
65 70 



<210> 5 

<2l2> DWA 

<213> tfano a^pLgne 

<4d&3- 5 

cctttjapcct gttcctctac; otcaccaggc Gfccdttttcp acatctgttH aac^atctgg ea 

otcttagggc CGCtttCaat tctftfftgttc tggaggtcca ttjaAc tgfcfcfc ttgtgtasta L2D 
gagaagaact tttctgaagt. tcgaatatat aatatatgac H9B?g4 Mtc actflttatga 190 
acefctaeogt tatt^ceatg s3fl^ gfl t gM gactsa'Ac^tt g*t tccttgis aceccaggnc 24d 
ttgggcotgfc actgetataa ttptgaactt aatttcpfcgt fftswetettg □ttggtfccat 
ggga^ggtct CCtaAcataa. tgtttatatc c^tggctca pgcatagaga cccacteatg 3S& 
gtfctWJCCBfc* agaatttggg tfc£a*gtLM CCacCtfCac* ta C a CaCa cC daflgctaatg J 30 

atfceatgeta ttgtcagttt ttgettcttt catattgtt* tgaagertagt tttt*ttgat aBfl 

acLtetttct ttctaaagtc ratgactaoa gttatggcaa Gafcattggca agacatfcfffca 5*0 

StgctCG<?sg flfttggagcgo attfattcat* tCCtggfltfcc Le C ?igg*aac COCagaagg^ *0Q 

agaaaaijggc aagastgtt* tgttgtCttt ffflafCtgtgga gttcat«att atgga^ctt 6&0 

tctgaMtat tsgca*ta 6 g ggcacatcac gefctagcattf outlet gtrac sigctctaaga 720 

aagcasagat tctLtttagg ca«ti C ag*g catfctttttc Lt^tttctta tcsecGtattt 7&Q 

CCtiatagaga tgtCRgsagff gtgpetgtta £te$etgat*p ittatfltggt gcfcgtgtttt 640 
tttgagcatt gtctacecag agtgattatc aaggtggasc ttatt«tqtt acc-cgflcaee 900 
tgcaactact t* C otc fl tgt t^ffd<rta 0 ag tggaaa<w«t. oqtattggga atttactcag 9*0 

snraorpcacoa ggwttGtta ga-aaagcagt abCgcagagiG- acttaaQ^gc tgagccttgt LOiO 

gategggtgg act*e W ttt atgfcccteac actgtcgett cotagat tt* tgacctcafft LQBQ 

ttcctcattt ctaaagtaga oacA TOff atg ga S gatflrggg agaatatgaa ctaatttgga mo 

ttaaatggat gagaatcaflfe agtatttatt tfltfaattggt gbgaahtyga ge^^tgggs 12B0 

«actgw«ac aaaasjrtgata atttatgtgg tgtagatoaa ttaaqaatas tgaaaacao^r 12 SO 

atgaaasyat^ grfiat^antt *agt 0CB a?ac atataaagg* aatggactflfl LaAaaqntng 

agffgcBaagg tgtntBffa&a tgggaattCd ggttggaaag ftg^Ct^ctga bmuggact 1J3D 

ggagtcgsga gaaretgetg* ccaaatgtgg ana t^aaftaL tttttaaaaa tfl^^t^g 14^0 

ttfffitafctag gcaagctatt. htaacccagg aeteaettto a-ttcasa-tcaa aatatttctt 1S0D 

atettccagt tsg«fla 0 gtg ttdtfi^aca ggftaaaaaas ggagat«ag a ataccatccc 

aga^aaaofaa fragaagtatt tflaattttgt tctcbagagt tgcagatata tgctgtatct 

*ttt=tatatg gtaatacatt gcaaaaoia^B tap G gt<gaag aaatgaagta aaataaaata l&BO 

tgtgtgtcLt cetet^^t gcataeagga a^acaataca tgttggtaac at^atttgc 1740 

tt^Bttgaa gaatttLtaa gg^gttatLc ntttaatctt tfttat B gee B gaattCactg laDO 

agct^aaaaa ^ataL^^tt tttttcctte tgtsgotrjtt ttttgaggat g^t^aatcct L9 & 0 

teaaaatttt tgegctggaa patattaggc eat 9 aggt^a ttttttattt att^tttttt L930 

aaggtagcag at^ttcacat tggts^ttag ggcfl«taae u M ggttg«c aag^Bagtaa 19&0 

^gttaga^at ^ggggtactg affttggaggt ^a^twc g^ggcea«2ff* 4tgg B acana 20^0 



5 



gggaata-tta tgaa-stafrca ggatttagtg ^AfegnaB^ca tacaaaaatg gacataa^G sido 
ccaggtgcgg cg^c-tcac-go scccaatiect cgcactttgg ga-ggcctagg cacat&gatc 2160 
acct-aacgcc ag^asttcga gace&gectg nccnacaccc «tttct4ct B uagtecaa^ 22£0 
aattagctgg gcgtgtggcci cgtgcctgta ccca&eta eteaggaggq tgaggcagga 22B-0 
■aactcgcttg accasacgag gtggaEfl-e cagtgcgcqg agatcec$tc actocactcfi 23J0 
agccLgggi^ acagagtgag acatc^tctc *a*aacaata aaaattaaaa aaaatggaca 
trStggtttat tca.ttggga& BAgQMMS tfltt«tfCatg agtgtaaAa.a tgtttfttaflC 2dGO 
agccaagtca ttgcaaacot ttctate^ta otmtgtgtte: tgsrtcttcafc atafcttttct 2&£0 
ttcfccracaaa btcfeucatt at n atcatag ttgtattcat ettcttgagt cagataaafca 2&3D 
atccttatat tttgacgaa* ggtaaccttc icttccaata t&aaaaaaaa -aaaaaae 2&r7 

<213s» Hcfflo s-apians 

Met Lys Tbr: Aep 116 Ala «3*r Leu Hia. Tfcur Arg Tbr Ttrp Ala Cye <Thir 

1 5 ao ib 

Ala Lma i^a Aeft Leu lie S&r Cya Vol Ala Leu Ala Trp PJte Met 

20 25 3D 

Gly Axg sor Pro Aim 11* Wet Phft !!« fier Arg Gly 5©r Sly rife Olu 



as 



45 



Tbr Hie Erttr Txp Pfte Prn Hi* Axg ILe Txp '>J*1 jp rQ Vai Hia Hia 5er 

&5 gfl 

Thf Tyr Tbr Hip glJl Oly 
65 70 



<21D> 7 

<2l2> EWA, 

<213> Homo aap-iana 

<2 2tf* 
<4DQ> ? 

■ggcgctggtg ^cLac^cgg csggcaratg^c- 
agccqffltfesjr^ ccccatgggt gf^tcooatjtg 
Lggcgotgaa sitataae&cg aaggatgaag 
gcaccgogaa flraceeggag;*. ccg^ataacB 
tfflceofccrcHt ecttaegget g^raoggtgc! 
tcattc^pat <?BgDafcfcOge attttcgtsa 
afct-ataccgtf aaeagagect tcca^-tc-Ec-t 
ittflDttttg t^c^cattaacs tLcar^cCgg 
afctat^gact gtct&atttc tatcaaaaoc 
fiftCAadtaaa tggAgat-fcCti agtgi; tttgc 
gAavjaaatga. aaaeaaacra ategctcttt 
atacatta-aa nttgtttctc attggga.n-£g 
agea.aggt4*t tgctt^gtgg acagataaaa 
acaaccngcfl cgaacgafct-t aaagptacaa 
HCBtgotgga StCtgaccCa gataataatc? 
tgcgtactgc ageattaegt acttttcgea 
etctacatcc aacattacaa gctggcqgat 



o^cggcgctC ^a-arcn^ttcc tgtcagg^tG 

caaat^tftg^ agcgg^gcpc c^gggatcga 120 

tggacggtgg gc^CGCgCQri: gGt CCiji^gg 1BQ 
qggccttcaa acagcaaegg c tgcqsag^tt 24ti 

taccftatLtt cttcatc-fltc ggtx;t>catct 3D0 
cotccavacaa -CatCCffC-^a-ii BtCgagrattg 3 SO 
gt-aataaars tttatgitcpg gat^t^ncsc: 4J2d 
aaaaptcatt tgag-ggcaac ^tdtttat^t 4 BO 
■otcgfccBtta ^fttgaaatDt. cgagatgata 
tt-aabc-cciig taaggaatgt saaecttatc 6 DO 
g^SgagCtat ta^taacagc atgtttaatg 6^0 
attcttatcc ta.tQ<3ctat<3 ^ctttgaaaa 72 D 

atgtgajnci tt cagaaaccti: cctggaggag 78 (J 

^aaacctt^t gaactggctt- Anaccagttt tJ-ao 

^Httcatajfta tgaggatttt attffttt^is 90 D 
agt-tgtattftf ttttatag^a aggaaaagt^g 

acfctLttgaa tgtcacat^c: aattaGCCC^ L020 
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tscatt '" t taacgsacsa ita^tg. tcttgagcac ratttcatgg atggga-iaaA lOid 
tttCffCffAtt ff«*-«tOff CWtWtc »t.t«*ta ct^flS-, 1 

ttgtacdgct agtaattaat cataaataca aaaacagtag taatwaact gaeattaccl i 20 O 
tuunctta tattatgaaa gcaaatcato tg= a<?gt g W tcaas^eeio tcctactcaa i 3 «d 

cctagQfcttc ffAStSJCtOfSt afcotggotag tatgteatte tgaagttggc acafcaacttt 

h«f^!!! atf "e ,tBttt srttgtttsrct tettneci™ ggatgacttc taaaaatata 13 8| > 
tfla^getat aa B1M * a tta gctatattga tcatatcaac acr.ffen.aet3 □ tg« M rtg BC Uda 
ttcgg a «gg «a t ag W c t Ctttcagtg "J 

acbtgttftct 6 «g C ttaa a t «Ctatfl« »tgttga t ga tattatatte 4«tatgLq 1560 
; ' : St^tattt t«t 9ff ag 0 ag agggattaca a*at 9 ta t ga gtataadgac l«! 

ttgctaaccfc Ctcassattc □g agM *gat gaa«aaaga G aatatctaaa ta*taca"tt 1<U 

tlZtSlltl 1TJ X *T atff * tttoBB t. BWW ; t t gttflaflgtgg t t WtBt c 1740 

tt t gtC a*go ttucttt*. t ttMMgsa cacg^tt.cc ie«o 

tCfftetffCca taagaattaa tgctcatctc t^agtggtt gggtaggtSC t& * tt agg,g 1 M0 
r^T*" Ca r Sttqrfft t g ,at gttsr oag tcaosaacoe a „ ct clctc J" 
^ tatflt4at ca «tCC C tt eaaecttata Ktiueugg gaauae^aa 1 980 

ttaarotffBL MnsHwo bgctcgo^ag « c ttp«ct tctgccteaa ct? t„„ ca 20 *D 

^i'"^ uettttu. ttcttgtgat c*g*aaatta 2loo 

««at«tc agg«g C ttt tacfceegoag tttWlfftgett tfiacttaagt 2160 
eagaaatttt tgaaageagt «cett„tt tcGtatgcat ttB-ttCoL 232D 
' ^gtcaaata ffe aa tgt gca tgcacattc* etttagtta* ggcacsaatt 23 E « 

?^" ffSttff * ct «* et " ff»«twrttt aaaaa*a t(Jt tct^taaatfc t^et^t*, 23 J 
atdaCrasrea tfcttggag** tgtacataat aaa t ta t * at acgtatft tg g ttgaajttat 24 DO 

izssiT tfc """ ta 4ataoatg « ^tESS 

tg=t«ggaa cttattctta a*g tt(; tagt taaasataat ttfctccaeat aeatgaaaat 25*0 
cSSaSn tttttmpb tgttttttt. 25B0 

b££S.« EH^^ST ffta *"»"* snwtgtttao t«gag t «« tattMccct 2fi 40 
ttaaatea4a W»*frast aatgtaeaet Mmttat atataatttt 3700 
aaaaaacaat gtaatggtta atagtagaaa tgtgccaeac ttttc^gtt 1760 
^ t t ^^*!! ttC cataatgact tttMtcit SiS 

lllttt^J: tfl5tct ff C " tSttagtcet tc*gaatatt atttattgtg 3M0 

ttctt ^ act: tSfffflftWEf ttttcgggga taeattgttg ggggag»sgg 300D 

«ottt^ !!? t ? Wwo " "^gsatgt cflctasagaa a^gttgaaffa g e «ttt B a ff 3W0 
cnccttctca tctgtggog* agatacaatt (twatwi tctgsragttt tatctgtttt 3L8D 

e^t«t» ^ 9COta 4SfttC — fc BttttoW «Wf t. 3«D 
etgttgota* ataCffaa^ag tacgg^acat tcatatagto tctgtgaaffc atsrosrogvoa 330 0 
»*«f«tt b««ttt«tt ataggettta t g CtM a*t ff wtiSti JSEJS 3360 
t"S™a^ ^r*^ ^"^^ ^gtagaaga catt ?aagffa t^catta^t 5 2 
tt^aggoact aCtttflaatt «tflMMSHC tcccaatcgn MWttltt taaeaagt^a J480 
aagctaagaa atthcat t g a aatcataagg eag t tt«g C ataaattgat aaa^ta^t llio 

^TfT * asrbfl4 * tet -tttgtaacg ttgattttat t aaat gataa t ««JItt» ^50 
gtatgtccta t ffttB at M a aatgaa^aaa attaattttg ctatattoag gtgtrttgat »u 
aaaa C aacaa tgctooagt* tt gt t gctta CattCagC a C t»a«ttl2 cSSSSJ »u 
?!!!?! CC * S ^ ( = tt -" t " «t; C at SCCt araatoggata aaatgtaatt « ttgttwi , » M0 
!S?T? tta * ttACt * W«tt aa^* tttgt tetSooS 
!" atMt6t Wtw^t MotMret -agtttggta ttaffateaag 29t0 

t!^S rr^f 8»*«HtB ^gsg^taggc tcagtgantg cggttea flC c JOao 
tgcta^tct cctgttttea tagaaccatfc a«tbq UL tattgtatga «cg tt «tt 4140 
ntBVCtM* "gtaaggtt ttflttetttg t ^atgg ff tabtttg^g ggaLgtgfla «00 

tlaaaatc^ ^^™f " taffiM " tocr cat^.ccat t^gtaaaca 

'tgtcatttg gaacttcaaa ntaaatatct t 0 t««« M 4380 
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<W3> Homo sapiens 
<JOD> B 

Mat Ala Mat Aetl Tyr A*n Ala Lye Aap Glu Val Aap Qly Gly Pro fro 
1 5 ID IS 

Cys Ala Pro Gly Qly T*r A1 * Lys Thr Arg Are p tl , Acp Asn Th x Ala 

30 25 3G 

Phe Lya GlJS Gin Arg L *U Pro Al* TOp. GLft Pr* He Lav Tto Ma GLy 
^ 5 40 4 5 

Ttir val Pre ii* phe Ph= n« ra e fiiv Phe Ile Pro 31 

&g &5 6 o 

Gly lie Gly il<? P he Val Thr s er Aen Afcn si. Arg <jL«i ILe Glu n* 
eS 70 75 9 o 

Tyx Thr ni y Thr filu Fr* ser Set Pro Cya A*n Lye eye Leu £er 
&5 90 95 

Pro A=p ttal Thr pro Cys l-h a cya Thr 11= a» phe Thr Leu Glu Lv* 
100 IDS U0 

SOT Ph* &lu dly Aan Val Mu mt TV* Tyr My Ser A S 11 Phe Tyr 

115 120 12 5 

Gin Aetl Hie Arc Tyr Val l V9 ser Arsr A E p Aap ser Gin Leu A*n 
110 3-3* 140 

Gly Abp Sex Sex Ala, Lau Leu Aan Pro Ser Lye D2lwl cya islu pro Tyjc 
145 15« 1*5 ISO 

ftar? Arg Aan Glu A«p Lya Pro 11* Ala Pro cy fl &Ly ALa lLe Rla Aan 

1£S L70 

^ phe Act * -^P Thr LHfiu DLu I.B.U pha Leu n c m y ajpti abp Ser 
1EQ 165 up 

Tyr pro lie Ffd ii B Ma Leu t^ra Lya Lya Gly 11= Ala Trp Trp Thr 
195 2»0 £0S 

Asp Lya Abel Val Lya ptwa A«p isn Pro Px& Oly CUy Asm am L«u <xlu 

21* 220 

Glu Axg Phe Ly* aly Tbr TiSr Lya Pro Val Aon Trp Mu Ly3 p ro Val 
525 2 ^ 235 2<jQ 

Tyr Bet L^b A«=p ^ eir Aep Pxo Aap Aan AAa Dly Phe Ile Aim Glu Afip 
245 250 25& 

Phe lie Val Trp Met Arg Thr Ala Ala rro Thr Phe Ai"S Ly* L=U 

360 $70 

Tyr Arg Leu lie Glu Arg Lye Aap Leu Hife Pro ^hr j^u i-co Ala 

575 265 

Gly Ar Q - Tyr S«*r Lev, Aan Val Thr Tyr Aert Tyx Prn Hie Tvr Ph^- 

33& 3K5o 



QQ^ri/irm aiti 



ACO Sly Arg Ly* Arg Wet H« Bsr Tbr lie Se* Ttp Met Sly Sly 

305 310 315 32Q 

Lya AAn fro Fh* ],©u Giy He Ala Tyx lU Ala Val Gly Sex lie Ser 

325 330 ^ 5 

PLfi Leu lily V al Val L*u Leu v*l Jig ABEl Kia L^cs Tyx Arg Aan 

Six £=rr Asn 7-hr Ala Afip He Thar lie 



*.210> 9 
^21 2> 777 
<21?> Cttft 

agaaattata 9 agtgagtst attttcettg tgtatatatt aaacacaccn ata CMaC at 6a 

tCSffCaaagHt gattacafcca iagaafcottt SgCfttaCCtt tttfaagctag tsgat-atbat 12D 

tggtptctet asratttbtat ataaatagtg aaa^tgaat tactgaaaac eat^tLaatt 180 
tttasaactc attnn^t** gtacaaaeta gbg-tgwct agctbcbogg u « M ctte 
tafce W Ltet taa-ttaaatt abteoaaapg gag^cafctfca a B c 0 cttqg A tttacac*** JOD 
tettttgtgt ttsctetbrta aaataaastg ctofltatttg tsttctccat abttt*gagt 3*0 
wictatctae at*atgttt a tagttcetflrt g 7 ttttt«^c ccaAgaaoca gaatctcatt 4*0 
cagtacatbt agtttt*ta» saoteatgaa gciMMtcct fcfl^ctafcgt gagaggcaea <3B<I 
AAStctagaa tgt-fftQfca be canaatggtg tatgtaeatt ttgtgpcttg attcaeetag 51Q 
*agtgtetca fifaaaacotgg ««gctceeb t«ta,i» c «ag aattttatat * Latttatg* 600 
agatsrattct gtacogtagt atatettttt ggg^atggac taabrfcfrtat ctgtttaact fi60 
catoLtc-tge apgatcteta hatagt^nt caaacttafra ggtgtg&wt taaatttaac 7iQ 
Ltttttt*^ aactaragAgg twabuaBt Ltaaactgct t***aaaaaa aaaaaaa 777 

*21D* ID 
<211> 2& 
POT- 

^400> 14 

Met l** Met 11* Leu Tyr Pxo so* E ie sar Ph* Ti-p Ala Trp ^ r Aa n 
I^u Ty* L=u Aea g** ^ 3or Ala ^ s ^ VoL T>T ^ ^ Ser 



20 25 30 



Aan Leu Glu val 
35 



*2JIQ> 11 
<Sll> 137B 
^21 2> DNA 

< 21 3-3" Ko«o aapiana 



<40&> 11 

ete*cjctgg C aagggcgct* oto^csctc ctot CM o B gwttggcgt tcac*tt C g C G& 

ggwttctso tacat^ctgg wcgcL^t cwtucwu ctcatcttct tcgnwttt.g lag 

acacattaba gcatttgata ^ctg^ag^^ tgatcaeaap B ato Et atag accafftfftaa iao 

tacc-ntgaat CCCCtt^ac t«c«»flti fiEtcafcr-jc BCtttettCt tf^tcatgtt 
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tctttgtgca £jta^B^tsg<? ttacac tsreg tctcaatattf cccctcttgg catatcatat 300- 
ttggag^tat ntgagtagac catirtoatgag tggccca^a ctctotgcHcc ctacaaecat 3£0 
caiaaatpcei gatat tctatf catottgtoa gaafftfaagga tggtgcaaat taactttt ta 
tettct*gca tttttfctAct aqcrtatatQ-fc catgatc-tat fftttt^tga g^fcptt-agaa 4B0 
e^Mcacac aagaatfcggt cca-ateaagt gcatgcasaa a^ccaccaaa tgaag^att 540 
CtatCcanc* aga-tcctgtc CAasagt-agc ctcttftfaatn t^atca^tta CtEtaaaaaa 6#0 
tgactCQtta ttttttaaat gtttccacat ttttgcttgt geaaa^actg ttttpatatg 65D 
ttafcactcatf ataoagattt taaatggtat tncgtataaa ttaatataaa atgattatet 720 
ctsj^tflttga caggt-tLgaa cttgcacttc ttaaggaaca gcicataatec fcctgaatgat 
gca,tt*atfca etflfattgtcc fca-gtacaten- gaagct ttt£ tfctataggaa cttetaagfle 
tcatttfcg^t ttoatfcgaaa aa^tatntaa tcataaatta gotgt^gats tca^trtgctt 900 
ctgatgaggt gaaaactfeat at-ptga-ctafl- t^aaactt caGffggt fctc ctqiatCtgCC 360 
atgtcgatca thatatatgg atacatttac aaaaataaaa agv-gggaatt ttcecttfi^D 1030 
tfcflaatatta t<?cctstata Ltgcatgaat Bagaflfattte <?c*tattfccc ateagagtaa 10SD 
taaatatact tgctttaatt c-ttaagcata agtaaac-ntg atataaaaat. atatg&tgaa HAD 
fcta&tttfta* agaatggatt taaigctatt ttaaafcartgrt ttttatttgt aacaG&ttae 12Q0 
t-fcattaaga* attggttatt atgqttact-3 ttetaatctg gbggtaaaos/ tattattaag 1260 
aatttgc^gg ta-sta*agat tttcaaaaet gouatgagaga aaattgtata accatcctsrc 13 £0 
tg-ttcctttd gtgcaataca afcaaaac-tnt gaaattaaga etcaaaiaaaa aa&auaa 1378 

*210> 12 
<:£ll> 144 

^113 > V\vn& aa&iexiK 

Ket Al* Vhe Thr Phe Ala Ala Ptbe Cyc- Tyx Ket Leil Ala Iau ken Irfiu 
TAx Ala Ala Leu lie fha phe Ala lie Txp Hie lie El* AJa Fh* Asp 

sa ?5 30 

Glfc Leu lys? Thr ASfr Tyr Ly B Aeji Pr& He Asp Gin Cya- Aon Thx 
3S 40 45 

Afttt Pro Vfcl L-9U Pro CU-g 7^r L&a lie fci* Ala Pte Phe Cy* Val 

&0 55 fi D 

Wet Phe J_ie'u eye Ala Ala Glu TrT< Le^> Thr lieu Gly L-SU Ai-n tat 
&5 "JO 7* B0 

trffU Leii Ala Tyr Kia He Trp ftrg Tyr Met Ekar Ar^ P^S Val Stet S^r 
B5 90 55 

Gly Pxo lily vea Tyr Anp Prp Thr Tte He Jtat Aeci Ala Aep lie LtfU 
1D0 1D5 

Ala Tyr Cyg Gin Lya <31u V\y TXP Cya lyes Ldw Ala Phfe Tyx Luy LSU 
H5 120 125 

Ala *ha Ph* Tyr Tyx Tyx Gly Met El* T^r Val Leu Val Ser Ser 

130 13 5 lap 



<210> 13: 
^211> 14S0 
<2J.2> IMA 

<213> «ojnO sapiexut 
<AQU± ia 
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cgaaaatgtc saeasaaagg ccaaaaattt gtacttfctcc actte.<raaaq c-cctcct ttt 50 
tG-ccctaaac ttetttwtgc caaattagat ttaatgagaa aptactattt ttaaggacct 120 
atctgtttat gt*gaat>gat tttgLt*aga g-caatQtnaa ptattattga gcagargcct l&O 
a&Agaggact GtQfeatt tt fc gcCstttaaa g^a* tcaq:a.a acffatcatflwfc t taagtqagc- 24t\ 
aaaaadgaca. agtttta-sta Gctaaq-taga ganataaatc tcaoatgoag cgctacaatt 3 DO 
fctMttatDt taagtacatt ^tacatttct acaeaccctg CffGattfataa 2&0 

ggatggt*ct tgcatatggt gaattactac t ff ttg*cagt t tec ff ca&4a atcctatttc 420 
aetegaccaa cattgtggca tstftaeoaaa tgccaaeafc* ttgtggaata gcttgcaaate 49 D 
tacaagagac «tgffttflP0t ttttc^tttt fltsttctttg tttfcttCGCC Cttetcctg* *£C> 
atcagcaggg afcreaagg*g ggt^g^ag ttatgaatta ctcet tccag tagtagctct &0D 
gaageg-nean otttaatatc a^tttttctt aaacafcfjat t ctag^taaat gtagaasrAga &£& 
ga*g*Mgag gaagCffttca cttttttaat aw.ctga.ttt agaaattc^a tgtnttatst 12$ 
cagtagttct ^ggtattsa tagcttgctt tatttctffce tttacgttga c-agcgttaaa 7B0 
•S^gygtgaa CMCtaftHre otatattttt tttttttttg fcaagctgttt C-Stflatgttt £40 
tetttggsat ttccwataa gttcagtfdaa acattctgca Cgtttftatct agtctgatgt 3&0 
acttatceat cteattacaa at&aaaa^ac acagactcrta tttgtagctc tstaatcerct 9&0 
gaatacgrga* gtaaattttc ttqtttcctg attttga^t- tgtasctata et&tttccat ilttfl 
ttttgctctt acaaatcctfc cg^gtcta*t tctgtgagcc taaetatagc ^tggafctaa 10 B0 
aatflte*ai*a\ teatttcttt agttatcojg tfcaactfctaa aactgttgta -saasrsataaa 1140 
ocagocc^tg acaggttttt gw«catgtta aagaacttca ttgttcaffeh ttcatgatta 1200 
ttgtgtaags aa^tgatg ta^atattet gtgctgtcct acaccatgtt aattacactc. 12&0 
acffaectatt tbagttosac ataacaatga tttstcccoa gt;g*cccttt. tateret ttot 1?20 
aggcaeaLtt cttgttflttg ttgttgttec a* t tcncgtt tSGattfftAt tgetttgaeak 13B0 
actgtaattt gaatcaga*- tgaaaadgst cnscgaataaa atatattttg at-attaaaaa 144 0 
aaaaaaaaaa 

<=2il> 102 
<212i>- PftT 
<:2L3> Kcwig Bapifitl* 

«4D0» 14 

Mat flln Arg Tyr JlAii Phi? HlB OVr Leu Lya <cyr II* Val JKLa Fh* TYr 

15 to a & 

ilrg Thrr Cysj Tvr ae* *rg Met 11= Ar-g K* t Val Le<u Ala avr 61V 

20 2 & 30 

&lu Lea Leu TIlt v^i Sft!f Ala Glu H e L^U Phe Gin 1«xp Thr 

3S 4 & 

AAa 11c val Aia Ti"D Gin Gin Met Pro tax Pfc© Cye Gly lie Mb Al* 
5 C 55 &Q 

Aan Ql» G lu Tht L=u v«l Gly Phi pfcl* Cye Pht La U Ova Pb« 

** « 75 ^ 8D 

Phfi Fro mu, Leu Lftu Asn &ln &L y Trp Lys GlU Gly Axg alv vol 

Hat ftapi Tyr Ser Phc 15 In 

100 



<21D> IS 

<-aoofn is 
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Qrna^aa-CC.^ afcA-Et^n-tg^ cc cang-ogag gagttgagca gtgctggaga ^toac^Ma £0 
cng-aDagtct tacingaagga agaggagagg agtcagecaa C taaaaCGtt: tteafccttct 
ca.agagc«sc<? ctgatgaagg aacctcaggg acasatetsa acaaa^gctic mtcyaaagaat 1B0 
gcttt£tcct cuaatflcatctf tflf-aatrtgagrp cLtagtagcc cdccagggaa gccagaagat 240 
ttatcaogt^r ttgatggLca gtcagLgggg aatccngttg ggGCagaaac t^gaggagag 3-<IO 
aagaatggg*? cnjgaagaag* ggaagaagag gaattCflrafeg acetcaocca agatgnggaa ISO 
gatcaaatgt cat^agcttc fcaastfaatct gtgctttctg teseogaact cca«g*aAcjn d2D 
atggagaaac t^aattggct ggw«Ctg» aggcgcatga fft^'AsJCfttffrs tgagtptgw* 480 
gaagagaafct ctcaggagga eaactctcag ctfaGaaaraap aggag^aaga agaagcaaaa 
gCflAttfflaaa £K?C Lc^otfaa Agaggatgao atgaoggatg aagcagtcgfl afractctgofc gQ& 
gagaagcctD ctacttttge ttcacctgag actgc-tcGag aatrt&vaaac sttg-saga^ct 
ccaccaggag agagcateaa agcfc-scttfg'a aan^gnogga acaatcatcg. a^ctc^^ac "?20- 
BAgcggggaa gtcgggctcg ggcc-ageaag gaqac-ctcca agGfcfltftCftt gctgtatgat 7 8G 
gaggRcattc? tcgagcgaga tccactcagg gagea-aaa*^ acttggcct r, tgcccaagcfe. ad* 
tatctcracca to^aa-caa A^cttocM catatccctg geaagtatga -ssffaethcctt pgg 
caagtcatc-t atgavatttga gtcftagtac-c cagagacgga cggctfftaga tctctacaaa 960 
agcctgcaaa ttctgcfccca agac-tgacct catfctatcga aagactttg-s bgcttbMtg 1020 
ttaeefcflaGC aafipste Ttfpc? ctgtggatta tttgaggags ngcaggcttt H-.£a£aagago 
cgcaagttcc ttcggcjagct ggagatttgc tttgcagaga aeCtfCtcaca caa<?i?*gaag JL2«D 

attatc^agg tcctccsasg ct^t^cagae tffe^teeeoc aggagateae cgagcfceaa-g 1200 

acacafifat^t afl-ca-JC^ct caagggccan gaccapcCgc aggatgagtfe fctctatsttG 1230 

tttgaocact tgcgcopagc agctagccgg atgggegact utoaagagat ea^ttggact 112D 

gaggaaaagg agtatgagst tgat^ffcttt aaavaaaptgg coctgcctga cgtg^aagaa 1J$0 

0aQflraQ^a©G Cfcetfeda^at Accmcb ? dc toaaagaec-a agaggaaaaa apagatcggg 14^0 

gtccaaaatc atgataagga gact^gaatgg ccagatgggfl eeadflgriatg tgccfcgctCC 15QB 

tgccatga«g g*ggtccaga tt^caaaet^ ftaoaa^a^t* -aaaggpggag ct^tacrecac 15fi& 

tgtagcagca asTOtekfl-boa caytaaatcc tacaagagc* aggagcccca fc.fira£ttggeg 

ftEjffiag^MUJiiiit Ccnacogaga ggat^gtCCt Btgcctggtg Ctaaaff*agc tgggeagggc Ifcgfc 

aaggat*tga tggaag^gga agccccacag ^agcg«ffafir& gcactgaggc cacccaga^c 
aggactgt^a ggaecacca^ aaaffggajja^ atgactgttt caggatcgge agtfl^^ago ibqd 
acttttfCCat CtCCttgaga agtgaotgtt acogaaeggc tCGtCCCflfiia tggDCo^cga 
cctoflttcac cagagactCC- tCiatttCCC CCGac^acttf -oae^tgtant gt^csactgct 1$2!> 
a.*gaga*B<;c aggtCgggcc fcgag^ttcirc tec tgetci-a aggcotcpcc ca©aGttca^ 1?BD 
aaaflfagaig^fl aorafl'CCaaaa ggaagtgagt gagtcag^gg etttgacget ffg-cc-tgggat 204C- 
gGBtcaflaan ctgagaaatt gqctggtAPC gtgggaaCGC CCtflfCtttct tcctgagtcc SlCil> 
tgtttcctca aagaccagag at^cagaca-a aaoacat^tg tccgggeaaic ca^acactca 216E> 
agagftgat^g Gt^ccctLcaa gta^ajjrctcg ggtgacgapfl agagacagga ^fij-to^ectgt 22 2 & 
tGatflaattt coatcaggaa ttgacaq^tc agagacttct CCCMa^ccc atagaggggg 2280 
tttggctaaa gag«gt>ggaa caca^gccaa flflrgtcea^as ggggagcagc agcoaaagfic 23 
egcagaaget aoacftflftffta , ecaacaaca^ caaggtoagc tccact^g^g aaaa^gttflt 2A D& 
^tgt^na 6 a^g^agc-tg *t?<?gfcgfcgat CCtcaCGAt^ tflfCCajyga^o aagggg^aca 2460 
gecaca3a.ee tt.ca«catca tctcccaqyca: ^ct^wat jaagaqccctg ctcacgtttc 2S%$ 
ceaccgtttt cgagaactta tgea^ctett ccocactg^c tgtgaa^cca srttctgagvgia 2^B0 
tg&g^atgat gcaaccagta Ot?ag^:atttgc agaCG5tgct>g tctijaecatg ggga.wt.tX3t 1 640 
gtctgaa.gag gagc-tggatg aatgagactc tflf^aatnat ctac^cAgga ccaaac-ccaa ZTDa 
ca-agcgccct ag-SaCC^^g aggg^gtagt tgtacti?tgc tt^tacagce ettgagenea 27^0 
gtttacagat ntggagagca gg^g-g&cagg acaagcacaa a w *tggagg atggagtagg 2flSo 
accc^ggggc totgecat^c taorgcaeeafc tcaaggtctt tt^tgaagac ttLacajeat^ 23 BQ 
tCCtCt^taft Atafi^atc^a gfigtggagtt e^gct^Cttt CtCtacttbt ttbtggLctg 2940 
atggcacata tttftttgttc t^tgnstctaa tcBcau^t^tit tctaaatgta aaaagt^cat 3<1D0 
at^ttgctgC agctaatccc ffCflfatattga gotcp^tgc atgaagacac tg^gctcctg 20tO 
catccagctg tttttaLtgi? aa fi [?tngctC Ctttctccea Gattgg^aac tttagt^cac 
gaggctgtca ccacvCGtgfit a^ca<?tijgpgc caggctttgt; ^gctcctgca gcagttitije 3.J.B0 
cat ^Le ate a h^tGcactn cagcatecat ga^gGtgscc caoc^ecgca agtcgagttt i240 
ggtgaggtct ^tggecaaca cctGcaagat ct^gtgeagg gacaa^gagg aacacagtgc 
<reeaaa*ft<rfc <2fi«ah.(ictct cnaDtgctgt gg^gggaga^ gaaaca^agn c-ctgtagatg J3f0 
gatgfl-ttatt ctgccctggg actcsccaaa ctaataa^ga agtc-caa^ct tagtagactt 

^attCtaaac tGaaeaaatt tqjyt^itattg LeccattAgt aaacca^tac tCcagaggaa 
gaatgctttt ettgggagee? a.tagggtgaa taaagcaat^ tttaaetgtg aa-aaaa^aaa 35^0 
^a*aaa 3546 
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<31t» 16 

<2T2> 

*2t3> Homo sapiens 
<«SiC i Qp- 16 

KlBt ABP PTC Gltt Vai Axg i* U S** £ Er PrD fTO Qly hyQ p ^ ^ p 

3er Ser s*r Vftl Ji* P sly Gin Sfer- vai cly Thr Pre vai Gly no Dlu 
2 * 25 30 

Thr Gly Gly Glu iyg Aan Gly Pro glu Glu GlU Glu <U* Glu J^p Pbe 
35 40 4& 

A3» As* Lea *hr cAn Ae» Glu Gly Asp Glu Jrtftt £=r *«=r A}* Ser Glu 
£0 5$ 60 

Glu Ser v*l L*u ser Vai Pro Qly Leu Gin cm 1-hr w=t siu hVB L * u 
65 ™ ?5 " ao 

TUT Tf& 1^ Ala Ser Glu Arg- Arg Met Ser GlA Olu Dly clu Ser Glu 
65 90 55 

Glu Glu Asn Ser Din Glu &lu A*a Ser Glu Pta din dlu dg Blu Glu 
10 * 1D5 HQ 

Glu Glu Ala. Glu Dly Met Glu 9cr i*u. Gin Lye Glu Asp GU U Hot Thr 
US 120 i2& 

Asp <Jly Ala Vai fti y Ser Ala -slu Lys Pra Pro Th* Ph n a! q s 8r 
130 13& L 4B 

i^P Glu Thr Ale Pro glu Vai Gin Thr AXQ- Tbr Pro Pro Gly *Zu 

ljJS 155 lfi0 

a*x Ly 3 Ala Ala «y Ly* Gly Art, ten ABU Kia Ar* Ala Arg ^ n 

170 

Ly* Arg Gly Ser Axg <U a Ar* Ala £«: hys Abo Thr Ser Lys Lck* l*u 
Ifl& 185 1^0 

l=u ^ u Tyx Aejf> &lu Aap lie Leu Gly Arg Asp Pro W Axq- Qlu Gin 
155 200 205 " 

A*J> Leu Ala Ph* Ala Gl* Ala Tyr Leu Thr Ard Vai Arg Ala 
feL0 215 220 

Gin Hia lie P7r 0 Gly Ly B Tyr olu Ae» Phe Lr^, alw v*} He Tyr 

225 a2D m 240 

Glia PtL« Glu **v SeT Tbr *i* Ar« Ar* ^ Ala vai Asp Leu Ty* i>y B 

Sen L=^ cln lla l=^ l S u Gin Aep Trp Pro Gin Leu Leu Lys A= P Ph,^ 
16D 

Ala Ala Phe Lfcu Lnu Pre Glu &ln Ala L©U Ala Cyfi lily Lou Phfl Glu 

275 260 
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Glia Gin Girt Ala Fho Glu Lye S^r Axg Ly* Phe Leu Ax* Gin L/r^ Glu 
290 235 3D o 

lie Cya Phe Ala Glu A#n Pro Ser Wia ai^ c} n n<= Ly= ^ 

305 JiD 315 320 

L&U Gltl Gly C!yc Ala Asp Cya Leu Pra Gin Glu II a TOr Slu Lao Lys 

32* 130 335 

Thr G1H 3tet Trp E?}n Lau Leu Lys. Gly Hi B Asp «ic Leu Cln. ftBp Glu 
34& 345 350 

Fhe Sex Pie Fb* Phe Aep Hia Leu Arg Pro Ala Ala Bwr Arg Met Gly 
•355 35* 36£ 

Aap Ptv- Glu Clu He AjSft Txp Tbr Glu Glu Lye dQhjb Tyr Glu Pfca Asts 
3 7 » 375 3fe^ 

<Uy Pb= Bin Glu Yal Ala hen Ftq Aap Val C31u Glu Cttu Glu Glu Pro 
3" 335 400 

Pro Lye *i D Pro Thr ALa s*r j^yp Abxl Lye ixq i.ys Xy 3 Gl\2 Ilfi Gly 
4<15 4LQ 41S 

Val Gin Afin His Aap Lys Olu Thx <?lu 'Irp Pro Asp Sly Ala Lya Acp 
420 425 a 3 a 

Cya Ala Cyc B*x C/S HiB Glu Sly Qly p r * Afip Ser Lys? L^u Lya Hyft 

440 

sar Lye Ar$ Argi ssi Cya S«? Hi* cya ser Set Lys Val a?f Ser 
"0 45E 4^ 

Lye Sex Tyr I^ya. s Q r Ly* Glu Pro hi* pl^i Leu Val dly Bar s*r Fro 
4&5 47B 475 ~ 4ao 

Hie Ar$ Glu Ala Ser Pro Met Pre Gly Ala Lya Glu Ala Gly cUn Gly 

a35 

Lya Aap Hat Met Blij <?lu Glut Ala Fro Glu Glu Ar* Glu Say Thx Glu 

505 5-1 p 

Ala. Thr Gin S=r Ar-g Thr Val Ar^ Thr Tbr Arg Lye Gly civ Met Pro 
515 5JQ 

Val Sar <3ly Irtsu Ala val Gly Ser Thr Leu Pro S6r fro jurg Glu val 

Thr Val T)t* dlu Arg L^u Lau Aflp Gly Pra Prp prt> FCia ser Pro 
545 *5D 

GlLi Thr Pro Gin Phfe Pro Pro Thr Thr Gly Ala v^l L«u gyr -Thr Val 
565 p 570 5T5 

Ly* Arg A971 Gin, Val Ctty Pro Glu Val Arg s^ r Cye Pre. Lya Ala Ser 
5BQ 

Pro ATG Leu Gin Lyg Glu Aror Glu C3ly Gin Lye Ala Val s*r Glw Sea- 
595 «00 &0& 



]4 



Alu Ala Lpu. Met Leu tol Txp ftpp Ala Sar 



Gly Thr Vai ^ly Tfiif P^ro Cya Ffce l^U Pro 

Acp bLxi Arg eye Arr: • . l-u x.ya Thr Cya Vsl 

Axg Met Ala Ala t<*v Lye Gin Ser aec 
&6B S65 

A*E> VaL Pro Cya- Pro 
&75 



^210> 17 

<211> 

atgatactftc ccLgcctcgg C^fifotgcagt <zgc<?gcc$cc geeeecGrcag g<?cgg^tt£r«n 6€ 
JCC^cagcgO cgffSCgaetc cgccqgggcc tcgflrfttetna tcacatagg* cagtfitflcac 120 
ottangntcc taaag-aaacg gcacaaaafca Ltcugtg&t gtttagaaat ftacttgtgag 18 D 
srsttfGfltcitf reaaattatg caoerfcatcag g M Rcaggct cctfy&ac^tc gagcatcate 24 D 
ctetectg^o t^a*aat#fle aactnt^gact cttaafcctfti; ctcctcctw ■gagfl'fit^ftcg 300 
tggctgaccg ffQtCt^gttc jatcc«t«acg tfCHjrtsjgcat gatCtgCffCt 0 taatgacgt 3«B 
ggcttcfcrot c^cctat^cA sactte^t^g tgac ttfccgt catgctgetg cc-ttccaaas? 42D 
acttct^ta ctqtgtffdtt: Aacggggt*:a t-Ctt taactff £:t tgflccgtg ctCg-CGCtjJt 46 & 
Citcccfiwt gagaBLCCat^ et^R^cgacC Ctfftffcfrfcagt aec«aagga aae&ttangA &4t> 
a*,gaatacat flflfasjrogcttg cagctgaagc ocggggaagt catctacaatf tgccccoagt frOtf 
SCtfiT^t^tAt taaag^cgag c^gcKMD a,CtgCA£tat ttttCAaaaga tgtattCSfla B60 
Aaat^gatca t^acCgcc^ tgggtgaaca attgfcgfcaflu agaaaa,goat caaafffctLtt 72$ 
ttgtget-Ctt: cact&tgfcat wtagctetB* cttcagtKa t-gctct^atc ethtgtggafc. 
ttcagt&cac ctcD&gtgtc caatfargcagt ggoctgaatg cajyttfatttt tcaccfcccga sao 
tftActgtaaL cfftgtt^ate tti-ctgtgcc tfcgasrgattt tctgtttttc actctcattg 9DQ 
cngttatgtt tgetatccsaa fctccact cce tat^caacga cgagacsrgafl- afcc-gagngat. 9&0 
tgaaaagtga ^aagai^a^a t^a^cgga gggt-gcgatg acaa^gaLg aogcccgtct 1D2D 
ttffdggg^Ec; scQctcacfce r-tctggatga atcccttt^t gggat-Scega tttasr^i^o 1D$J> 
tgcp^^ag acCCa&a&aa, gigtegcccflrfl agttctc-agt gtgaggcxatL0 jjcteatcaga 114o 
ctgaaacfct* ctonoogact tceatfttatt tatt-tggggt cfctfaosgata tcaacagctc 12 DO 
atcfcij-tpacc aac-aggsrcaa ctggoaccta cacaaaccaa ttg^ttgcag caacca^ogt 12 £0 
tttata.ta.fct tafcagtcaca. gafcggcBflaa 9&agaggctG tcafftxjccca catgtaeaa.c 23£a 
aaccgaaaesr t-ptgtggcca cao$*agaag waaacgcc^ tflfOccLnctg vv.g&gvtqv& 13EQ 
Sr^tfcctgtgg agaatacttc flggtCattM attfflfattAtt oaaftt^Ctfl^ gtCCftyaget 1,^0 
gctgtttcca ateatgaaga aaaaca^tfla atcoagtgaa cag^gatfccL ceaagc^gtc 15D0 
aLttca^g^g gct<?ctgct« atcccgccpc tcagcaattfe actccccgga tcacagca^ 15&0 
gcgttt*i-at agaaatfafigt tttg^tctcg attagctcc^ *tgcdttgca ^ttpaagLtgp 1$20 
aaaasatatg tij-cactg^ac agtgaa^gtg gctt^cggca ^actcoccgc tgcc-ccggaa 16B0 
g^a^atcct tt^accctcb GHgg fl agtct fftffttft^tge gtgtctgtgc ^t^t^egcge 17^0 
gtgtgtgcat fltatgtca™ atbgccacrcg ttgtitt*ggc aatgtaiort tta^ggct^ 1B0D 
tsjtacag-ca* ac^agctatt ttttag^a^c c^acctttea gggaagaggg gacacacrccg 1B*i> 
cggggtcctB cocffH^tta ctatgaatat attg^tgt^tg gaggacatct cgatecnaog 1&Sd 
aacBQfcc^rt cctgtgcgec crcttcgttgi? cctcctcctt tcatttttta aagaaatctl 19SD 
gngtgpttgft ggffcettfrga actsattttt tdtfctttgt-t: CCfligccaaat tagcagtgtA 204C^ 
taaat^gcBtf ctaggtaaga g<5&aa^otgc gg^tcgst^a eiit^atactt ggggc^gccc 210& 
O-at^frtctgt gcg-gggca^g Grgagig^atcsc ttactpgaga g^cagggccc a^ecattggg 215{> 
oBwt^t-jgff aaff^flrgaggg gaccaLaafiTG cagceagccc ctffflcfafrg w cgar.tgtgcc 323-D 



GlU -Thr 61u L^g 2_^ u p rD 

GlU Ser £^y= Fti* Leu L/A- 

Arc Glu Arg KiJS Sfcr 

&5S 

Gly Olii Asp L.ya Arff Gin 
67 0 
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ate^eiflrsca scgctccagt L-ug.qaat.nr5c cragsatsgcg cc^tctcgtL uga-gtntttg 
jyttagatttt acspttttcte ^tcqacccac racaoagg^g ataaaaeaug atcoct-ggts 2340 
cssragcttaa aatfcatgcca gaaaffccnac agctcccctc Qtupggcctt gccttaaact 
ttfctto-atiK. gtacaetttt tsac-tggacgn atcaasja&tfc aatctgtgac aaafftct^g- 2d*D 
Ogfcottcett tatgctt^ct otMMlCta &ff4$aagt:tg g^tctccctc ctgggaattg 
ttfcggcctdt GtfltttBtct gtgaactstt ttLtgttctC adttactcLg taccccatcc 2 53D 
gaatea^sjpe ttotaccact gctg-atg^a* aaccacaaag ggacctaccfi aasreeoacgt 2fc<iD 
cotagrcaag cgagcaaaee tgnaggg^gt tt ^ ffaagtg ^ acfctacreac cq C *g*ag C g r?0l> 
tgtgcffCGfft tffOjygg^aga gcCgCfffctoc agccagaggg Aoaaagtgtg ggtcatCCtfia 27 6* 
eagracaocag tt.bpcg*gat ig-ttctgcat attcatctgc «c*ttgttgt <rt$ggttgga 28 SO 
cetgcgt-gtg gg-cttcatftg tgaggcdtit aatBtgtata tcet^ttatc aataaaacaa: 
tCatccaaCft Gfrttgaatsc tfftflagactt gflcaafftflfCff Lgcaaatcaa fftatAtttga 2$4Q 
cttttcwDD tcttCtttCft atgtaagttt tatatgaaat aaagtaatta attaaeagtt 3 0 l>D 
«?tcaa*a*aa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaanaaaaaa aa 3052 

<21G> IB. 
*212> PRT 

<2L3j> how sapiene 

*4VG> IB 

Met Din Fro SBX Gly His Ar;g Leu A*g Asp V nl Glu His Hia Leu 
1 S 10 L5 

Leu Ala Blu Am ^ Aon Tyr Asp Sir 3«r s gr 3er Sex Bear s* r Glu. 
20 2& 3C? 

Ala Asp val Ala Aap Arty Vol Trp PH* lie Axg Aep Gly Cy* aly Mot 
35 *D AS 

lie Cyc JQb Val Met Thr *xp L&U Le\l Val ALa'tfyr Ala Afc£ *hr v*} 

&S & 0 

Val Thr Ptie If ml HBt Lfill Pro Sar Ly* Aep ebe Trp. Tyr Ser Val 

6& 70 75 80 

Val Aen Gly Val He pfce A*n £ya Leu Ala Val Ala L?u Se r Ser 

as J5 

fiia Lea ArQ Thr H&t Leu Thr Asp Pro Gly Ala Ua l p ro Gly 
1O0 1Q5 LID 

AlB TOr Ujfti Glu Tyr Met G>lu Bar Lem Gin Leu Lys pro Gly aiu Val 
1J -* 1S& 1S5 

Ila Tyx" Lyes Cy* -pro Lye Cye Cy* Cya lie Lya F^o Glu Arcr Ala Kia 
I3D 135 140 

Hi* C** Ser 11* cya i^fc Axg He Axg Lya Jj et His Mis Cy* 

L ^ 150 1SS ^ 

Pro *xp Val AKn ASM Cy* Val Gly <31 U L-y, Agn Gill Aru Phc V al 
16& 170 1.75 

Leu JThr Met Tyx rle Ala Leu fc** B er v*l His Ala Leu Tie Leu 
19D 1S5 19D 

Cye Gly Pile Qln yhe He S«r Cy^ Val Ar* <fly Txp Thr Glu Cya 

2f>0 2D5 
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Sn* ftprp rte Ser Pro Pjro He Ttur Vfll lie L*U Leu He Pha Veu Cya 
2LO 21J5 220 

Leu DLu aly Ley. L*U Phe. Ph* Tbr Phn 7'hr Ala VaL Met Phe '&Ly Ttir 
2*5 230 235 240 

GLa Hi? Ki* Ser lie Cye AAA A=p alu Thr Glu Ilfi Glu krg Leu kyg 
2-35 250 255 

3*r Gly Lys Pro Thr Trp aiu Aarg Arg L-9U A*g Tip Glu GQy Net Lye 
2€0 265 27C' 

Sor VaL r-ha -Gly Gly Pro Pro ser Lau Lsu Tstp Mat Aan Pro phe Val 
275 £aa 265 

gly Piie Arg Ph.* A:nj krg Pro TAr ASfrfl Pro Arg L.ya Gly Gly P*e> 

295 3i>0 

Cl-U Phe Sar Val 
305 



<212> M 

*£l3> HftttO capicasa 

ggatgctgga aeagaaaaat ggatttctct ttctctttca tg^aagggat c^t^^aaac £{) 
acaatttagc aacc?a.c<3tca actcattflac tccgccaaca teegfcera^a ggatgccttt L2D 
gataacaa^a -gtgacattqfii tgaagal-ggt ggCCagaCflC cataLgaagt? tactttgcafc L8D 

saaggctttc aflfcaicaagn tweaacagaa era tc ttcatc cactcacaaa t^gctatcca, 2 40 
tcatcaate* gtgtgtfttg^ aac teaaa-cc aaatac-cagt catataattfa gtatcc:ta.a.t 300 
^teagece atggcettsra tgcagtt^ga aactttagce ccaebaacta ttatc&ttca 3SD 
gaaafct^caup. acacaa^cc acatgaaatt cborgaaaaaa ottcag^tcc acaffccatca 42B 
CCdCTCtCCtt irfigtaccac* aa.£t0fctfAfct ccaaagaaga ctggctcacc tffaaattaea 480 
ctaaaaafcaa crcaaaactat teagaatggc agggaatt^t tfcflfafffc<rttc cctttgtgga 54B 
gacpttttaa atgaatftaca ggsaagtgag caeaecraaat. oaaagcatga aagcac/aaiia £00- 
gaaaaga^ga aaaaaag^sa caagcatcac taatcaagj&t ctg-aagagcg taagtc-acac- 660 
a-aaatcfccca aattasaactf agagyaac-aa- natagacoaa aeffa^a^gt tgacaetgCa 7 20 
Lcacnacoftnc caa^fia^aaga at^agtacta aaasacraaag Dcecagttca gecaataerta 7 BO 
tCttCtgttC Ciacaacgga ag tgtctat t ggt-pttaagt t tcagetta*? ^tfatntfcgtg 840 
CiSflrtlfiiaagg tgggaaccta tticttggtgg CSttgtatgg ttt eaa^Lga tc^rca-gctt 9 DO 
gnggttcata. Ctaaaattaa CfliCa*ga.ffgt geccof-aflfaat tttMtgt9C* Sttttttft^c 9£Q 
*accai?G^aflf a^ogggugtg g^ttcat^aa fiaaegggtae gagagtatBa a^taaLaaa 1020 
caflrtatga-ag aattactsrac tgaggcaacc: aaiacstagcca e^aattoctG tgAgaa^caa 1DBD 
aogattcgga aacee^acc- tcagagagaa cflffc<jcfcoogt gggat^ttgg catt^cccat 13,4^ 
gcasacaaatf tattga*aat ffacte-gagaa gaaagaatag aacagtatat ttttattt^c 
attflaboaac agectgaa^a ggDtttAt<;c caagcaaaaa dgagtgttgc ctccaaaaet 12 ea 
gftogttaaAa aaaecc^acg accaagatct ^tgctgaata ctcagccaga acaflanoBac 1320 
gCBflflorCjaSg tggwtCGtC afitctcaagt a^tgaaattc ggaffae^tag eoagaggegg 23BQ 
cacacaagtg cgeaa ? agga agagccaccg iCfcffttaaaa tagoqtggaa aactfftSfgca 
gcaaggaaat etttAccagc tttfiattac-g a^g^£?s*a^ ^gagect^a tttg^g*a^ L5D0 
tgtaaoatgi; ctccagttgt gAsanttgaa caagtattt^ ctnttc«gaa ts<?ta«a^ L56Q 
gatgggaaat ttatagat^a Rtttsrtttat tciacaaagg gaattfl^taa caaaacagaa L6^0 
abaagtgtc-a ggggflfcaasga caggi?tt^ta atbtctaca<: caaaccagag aaatgaaafls L6B0 
np*acg<;aga Qtflftatc^tC? tCGfcaaagia atattztggtt ^tftC^g^CtG afltaganaag 
aa^caaca^a gaagatcaat Hft^aaatcgt. tctgaatcac aaaaatocac tgaggttgt^ 1B&0 
ccaugafigfl agatcaaaaa ggagca£PGtt fl-aadta^tta etcaggx-t^C a^t^aaciact ie$0 
ggattax:aga aagggtogg^ Gffscc^ggga gtgpagggcfc ct^tcagatt caqtgacagc: 1&20 
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tc-cgtctcco, Ca^CQ-UtCflr^ ggaaectgtg gactgagatL ctt 9 tacaat ttctftcccaft L9B0 

aa*ctccaga ctfc&tag-tct cc^tgc^asa LtLetLttftc ggcgg^ttg* taaac-afrttt. 2040 

CtttffCtttc gatttSgett t-C^C-caatta Lca.tfc.at tgg ca£tttcet-£! LCt^gt-t tc-t ^ 1 00 
LtiLtcaag&c tctgascaat t&ttttaaca gtcaaaCgat tt t tt ttc ggtttgngct 2lG0 

ggafcgggtac agcctaaatc? atgggtecag cctaaaaaec aDcatt.ta.ac ttacactffafc 2220 

caafcttcaae atggactgtt tttggttttt t fl tttttaaa taaagc^tca ttsatgcaca 2280 
fcfcto.cagggg tttgccaaae dftcccaanct gtatacatta tfaattattca aagttcttat 2 340 

tttttttaat attagtgccg ttatcatgga cra&ea&catg acag^t-gtct ttggcatftct 2 400 

gtcatfctfctc tagc^ttttc a-aaaacbtat cggcaatgge ggtacctgna tttc-cDttcg 2460 

aaagcoteti? agtacascaer tcotgttCQt Ctffttaaaacr LccttgLtaa tCC&StSatC 2 52D 

ttttag-gcca aatraiiatact ttgtgat^gt &t.t6t^ggtc catucacMg caat$aagga 2£80 

aataaatttg tgtactCgtg ttttttaatc agcftttaaca t&a(AC'ia%Q>-n c-cctsa ttta 

tagatgt<?*g gaaacatttn gtgaa^aact tgftaffaatsg gfitgt.ga.tA?. cgag^fchccfc 27 0G 

gtaatc&gap eagtctj&acg aggccca-zee cctccaouae aga^aogl:^,: titpttccao 2760 

gtCjctactet cactca-rjCCt ^C^cggatC ttcat«5gcct cafl^agactt gt/tfcctccat £3^0 

gggctcttct BGrac-tsrcaca cttcc*ccat aarett$*tgg gtt gate tag atgtctffett 2SBQ 

gttgtatacrfl tattttgggg gaaaaaatcc aaoacacaaa c tgtg^attii aaatattaa-c 2940 

cgfcctccttg gtt&Cttgtft fittoaocgtg COtgaLcfc.Q'C fltaLttcatc g tggctgtt fc JO DO 

ctgt*tagcrc t,ata£t$cat tagcccaaga eattgttqct tt-gtaactti: ttgca-atat t 3Q£Q 

StttLgflfCtfl 9atttgtatt ac*cac&£tt ttaaaaaaaa caataaaaa-4 Baaaaaaas* 31-20 



*210> 20 
<211> 64S 

H&t Asp Pfce Sftx Ph* Sw: Ptue Met Gin Gly II* Met Gly Aan Thr ILe- 
1 $ 10 L5 

Cltl GlJl Pro Pro Olr. Li?u II© Aep Stf Ala Ann U« Ar$ Gin GlU AA-p 

20 25 3D 

Ala PhB As» Afln Ann Ser Asp II© Ala Glu Aap Sly Qly Gin Tfir Pr& 
35 4tf 45 

Tyr <?lu Ala Thr Lfcu. &Lci Din- Gly PJ*e Girt Tyr Fxo Ala Thr TfrlT GlU 

«0 &5 60 

Asp I^vi Pre? PXG- Iniu Thr Am Gly Tyr Pro 9er Scr q;!* Ser VaL Ts-^ 
*5 70 75 9D 

tUu Thr oln Thr "byfc ^?yx giln Sei l^r Asn «3ln o^r p-^ro aeci Gly Set 
S5 30 95 

Ala. /»9n Gly Phe <31y Ma VaL Aro Aan Phe Sctx prg ^hr Tleu Tyr Tyx 
l&D 1U5 1L0 

Hi^ G-Lu lie Pro Apn Thr Arfl Pro His U^y 21 e Leu Giu Lye Pro 

12D 125 

Sar Pro Pre Gin Pro Pro Pro Pro Prp ser val Pr* Gin T>ur vai He 

Pxo Lys Ly* Thr Gly £er Pxo Clu lie Lye Lt=u Jjyi lie Thx LyB CThr 

160 



145 1&0 1^5 



JL& ^lix Asn Gly ,^fir &Lut l^g Phe Glt» Ser Scr La\i cy* Gly Aap lAu 

170 L75 
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Leu Aftn. Rlu V*l Gin ALa a*r Glu hIb Thaf Lye Scdt kyg His Glu ficr 
1BO 185 ~* 19,3 

Ar* bye calu ^ yB ^ JLy C Lya g^r Aen Lys Hi= ji ap 3 * E £er 

20D 

Glu Ollr, Axg LyE Hi* Ly* H© p XO Ly* Leu Olu Pro Gin GLLL Olx. 
*« 215 230 

Abxi Axg Pro Aen Glu *r.g Y-al Rep TOs* Val S*r Glu LV3 PrfiP Arn Glu 

235 M0 

<Zlt Pre? Val Lea Lys Diu cki Ala Val pa n Pro II* Leu fieur B«r 

3« 250 2 3 S- 

Vnl i-ro Thr TOr Slu v*i ser ^ Gly van hys Ptle GLn V al qly ABp 
2&Q 270 

L*U Val Trp S«ar Lyn v*l Qly Tti* Tyr Fr-p Trj> Pro Cy S Uot Val 

275 355 

Ser Aap Pre Gin Glu VaL His Ttvr Lya lie h*n -Thr jtrg- Ply 
230 Z9$ 3D0 

Al* Ar^ 01^ Tyy Hi* val Gin pu* Kh e So3 . Asn GLft PM QIu ^ &la 
31D J15 

Trp Val K** Glu Ly* Axg Val Arff &Lu -pyr Lye <3ly Kis LyA Oln Ty^r 
32& 3*0 33& 

aiu dla Linu teu A ia qi u Thr ^ GLn ^ Ser Aen Sex ^ 

3J5 35<> 

Lya Gin. Ly= il,* Arg- Lys *» Atg Pro GLn Jixg Glu Arg Ala Gift Trp 

350 365 

ABJ> lie <31y II* ALa Hia Ala (Jly LyB Ala W I^y? Met Tftr A*ff OLu 
270 275 

GLu Ar^ He aJy Gin -Tyr Thx ph« lie Tyx SLa a 3P bye Gin &lu 

3&5 ws 

Glu Ala L«=^ ficrr Gin AlA Ly^ L y* Ser VaL Ala » ftr LyB Tto G1U Val 
«5 415 

LyB Ly* Thr: Arg Ar^ Pr^ iijrg Ser Val Leu Ajjxi Thr Gin Pe* Gin Qln 

TUt Ann. ALa Ply Gltt Val JUa Ser S^r L«u <2*r Ser Thr Qlu tie Arg 
^35 ^I4Q ^45 

Axg Hi-s S^r ULft Arg jixg ftia Thr fl<ar Ala Glu Qlu OLu Pro Pro 

* &0 455 , W0 

K;o Val Lye lie Ala g^p t# fi ^ Al5 ^ Ala Arg Dyg ^ r Leu pj>j> 
i65 «0 475 tiBD 

Ala S« tie ^ Met Hi* Ly a ^iy Ser Lr^ A^p L9U Gin Ly S CV« Ah, 
4a - ^90 4&5 
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Jfet Bar rrp Val Val Ly B He Gly ©l n VaL P*e ALa Gin A*n Ala 

5C"D 505 

Thr Qly Gly kye *h= lie A*p GUl Phe Val Tyr Sfr Ttur L,ya Gly 

515 &2d 

ri* eiy Asn iryB Flir Glu IMe val Arg Gly Girt A*p Ar 9 L*u tie 
530 S35 5dO 

IU Thr Pro Aew Gin Arg Aaj, Glu Lye Pjre- Thr Gin val Ser 

5-3D 5&5 5*0 

3*r Pro Glu Ala Thr But Gly SfeS? TtUf Gly Sfer Val Qlu l.ya Lya Gin 

&7D 575 

■Gin AJff A*ff Stir He Arg Thr Ar? Set 1 Glu Ser Qlu V/* S©r Iftr &lu 
SBQ 5S0 

Va.1 Val Pro Lyfc l/y= L.yg Ho hys Iiya Glu Gin Val piu rhr Val hro 
595 6Dfl $05 

Gin Ala Tftr Val lys Tl>r Gly Leu Glal liys- Gly Bar Ala A3P Ar£? Gly 
?W 615 620 

Pin Sly Ser Val Arg Fhg AfSD fier £ex Val S**: Ala Ala lie 

530 £35. 

aiu qrly Thr Val Afij? 

645 



<2ll> 29 

<212i> 

<213;> Artificial Sequence 
<22Q> 

-^232> olis&nu-Sle&Lid* 
<220> 

^.22 1> Tnlsc_feafurc 

•tf223> binticyylatafi ^ocplioaxaittidi te residue 
<400> 21 

Ottt$gcBtt* c*ggfrgtGsjfc ggtgcAPAg 

<21tf> 22 
<211> 29 
^212> r*?A 

*213> Artificial Sequence 
<2£0> 

<223^ -pli-gonucltotiite 
<23.Q> 

c221> miacfeaturs 
^222* (3) 

^22 3? i?iotinylfiLed ph^gptioaramiaite; reside 
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<10U> 22 
<21ti> 2 3 

<2%i> aa 

<2L3> Artificial ^ecjuanci! 

-<223> olig-Dnvicl&Otidje 

tcttwctst atctcetttcr e 
<21D> 2-1 

Artificial S^Quencfe 

■c220> 

^2£1> miEc.featwe 

*;232> (J> 

<4D0> 2<J 

*riAn<?tC<?Ca gaaffaaupg-fl. gatggatcc 

<21D> 2S 
*2JU> 29 
<212> JDWA 

■^"21 3 > Aartifieia.1 Segu-fittCe 
<220> 

<-221* boiScu.f eat^r* 
<222> (25 

*n*ttcttg"t ffta^ELagcga act arte can 

<311> 2? 
<212> DM*. 

<213> Artificial 9«gu*Eic© 
<22D> 

<22t» 
*223* <2) 

*2 23* hiotinylatec! P^oepboeiraaiidl residue 
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<400> 26 

■c2ll> 2 2 

<212> £>MA 

<2t3> Artificial Sequence 

-<220> 

<22£> oligonucleotide 

<220> 

<22l> iniBt_J!*ahLLE-(!! 

<22z^ raj 

gngusgaagtt taacaEfcAaa aggogt^gct 29 
<21 S> DWA 

<21J> Artificial Sequence 
*22D> 

<223> Dliganuc Peptide 
si22> f2) 

<223> Mot inflated pbuMaphparamiaite residue 
<*4QI» 23 

fynp-ttttt^C! tcttgttett tgaggc-fcgS. J 3 

29 

<212> DNA 

* 2-1.3 > Aarti f iciai Sequence 
■<22G> 

<523> olipoiiLLclpQticlQ 
-=22£> 

<221> miec_£»a.tvir* 

<;222:> <2i 

<223> biOtinyLated plqesphaaEamltfifce rrt&idue 

ft«tfflratefc$r4 ^catggactg aaBacacra^ 2 9 

<211*- 29 

<2L3> Artificial SBCJUOOflCe 
-=:22.0* 

<2 22> oiU.gocHieileoti.fit; 

<220> 
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<223> bio tiny la ted P^OBpfiOd^-ajTiitf} tc rcsiaa* 
<4D0> 30 

iixatagacca a*tgag?igg-£ Ltffaca-cto 

<212> PR? 
<40D> 31 

Jfet Mat gly GfLci Val Gly Ly* Aorg i* U Leu llfi GIn Gly ^ 

OLu Hsu gly GlM L*U Leu Tht Ly E dy* Qly j^n GlLL Aap Ph* L^u Lya 

2D 25- ^ 

Hat dly Ala Ely Txp lie GlA Ala s H r ^ Fha AER Fr& Gly Lc^ Thr 
35 4Ci d5 

L«j Ele ain £sr Lya -TV* Pne- JJ=u feer s** Ser Trp Giy Gly Val Lqu 
50 55 6Q 

Gly 01* Ly» Lyg <3ly Aa» Dy= JUa Ttw He pro 5 La Lya Glu 
65 ™ 75 8 * 

Lyfi Tyr Lea Uia V*L Leu 

05 



29 



<210> 32= 
<211> 8? 
<212> PftT 

<4Q0> 35 

Met Met Phc il B v^l Pro Val Vol Phc uiq pi^ Ars Sa r Arg lie &« 

^ PK * S«r Phe life Ax^ Val Mat Ly* L*U Aaij Pr& 3w AL B 
20 25 3D 

Met Ser Ai a Leu &lu Cy* v&i Tyr-Sar Gin Try. C /K tot 

3* 40 4& 

TVr II a Lau Cy» L QU Aifr Sir Lea Arg 3er V Ql Ser Glfc Aon Asm 
50 *5 6-D 

Sar Gflj Lgu Leu Hla Lye Acn ph* ji* eye He ivr aiu A«= P A Sp sea; 
* & ™ 75 ao 
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